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TRANSLATOR’S PREFACE 


STUDENTs of what is usually termed “ General Relativity ” will need no reminder 
of the importance of Academician V. Fock’s work on their subject. Up to now 
very little of this work has been available in English and it is a privilege to help 
to make it accessible to the English-speaking world. I am myself no specialist 
in this field and when asked to undertake the translation I approached the 
project with quite a measure of doubt. 

However, even the first cursory reading of Professor Fock’s book convinced 
me that the task would be not too difficult and both rewarding and pleasurable. 
So it proved to be. 

Professor Fock’s clear and precise style of exposition made my task an easy 
one. I found that I could translate almost literally, in striking contrast to some 
of my experience of Russian scientific writing. The only sections of the trans- 
lation, and there were not many, in which the Russian original is not followed 
almost line by line, are those in which Professor Fock himself provided an 
amended text, which, incidentally, he did in English so near perfect as to require 
practically no correction. 

There are a number of reasons why my task was rewarding. The earlier parts 
of the book give an exposition of so-called “‘ Special” Relativity the merits of 
which I am more competent to judge than that of the later chapters. I found 
that I was transmitting a presentation of the greatest possible clarity and 
completeness with many an original turn to it, even in a field so often presented 
before. In subsequent chapters I naturally found more that was new and by the 
time I got to Professor Fock’s principal personal contributions to Relativity I 
found myself being guided through a most stimulating new field of thought with 
even the most lengthy and detailed calculation presented in a manner that 
never obscured the main argument. Where Professor Fock’s views are in dis- 
agreement with the opinions of others he states his case well—and his opponents 
have a formidable case to answer. 

The pleasure of my task lay, of course, partly in its scientific interest to which 
I have just referred, but, in addition, the work has brought me into a most happy 
contact with a great figure in theoretical physics and it gives me much satis- 
faction to testify to the ease of collaboration with Professor Fock and to the 
establishment of what I feel to be a strong personal bond. May it be a token of 
the greater bond of friendship between the scientists and between the peoples 
of our two countries. 

N. KEMMER 


September 1958 


TRANSLATOR’S PREFACE TO SECOND EDITION 


It has been a further pleasure to help in producing this second English edition 
of Professor Fock’s book. I acknowledge with gratitude the substantial help 
given to me by Professor Fock himself in providing a complete record of all 
changes made in the original Russian text. But for this aid the work of trans- 
lation would have been a much more lengthy one. 

N. KEMMER 


September 1962 


PREFACE 


THE aim of this book is threefold. Firstly, we intended to give a text-book on 
Relativity Theory and on Einstein’s Theory of Gravitation. Secondly, we wanted 
to give an exposition of our own researches on these subjects. Thirdly, our aim 
was to develop a new, non-local, point of view on the theory. and to correct a 
widespread misinterpretation of the Einsteinian Gravitation Theory as some 
lind of general relativity. 

The Russian edition of this book* was published in 1955. In the present 
English edition only very few changes are made : some mathematical proofs are 
altered (Sections 8 and 31) and some reasonings are made more precise (Sections 
61 and 93). 

It is hoped that the publication of the present edition, by making the book 
accessible to English-speaking readers, will contribute to a wider discussion of 
the author’s ideas on Gravitation Theory. 

In conclusion the author wishes to express his thanks to Professor N. Kemmer 
for his excellent translation. 


August 1958 V. Focx 


Physical Institute of the 
Leningrad University, 
US.S.R. 


PREFACE TO SECOND EDITION 


Ir 1s a pleasure to me that my book has received sufficient attention to make it 
necessary to publish a second edition soon after the first. This edition follows 
closely the second Russian edition published in 1961. 

The second edition differs from the first by some additions and reformulations. 
The question of the uniqueness of the mass tensor is treated in more detail 
(Section 31*) and is illustrated by two examples (Appendices B and C). The 
notion of conformal space is introduced and used as a basis for the treatment 
of Kinsteinian statics (Sections 56 and 57). Greatest care has been applied to 
the formulation of the basic ideas of the theory and to the elucidation of those 
points on which the author’s views differ from the traditional (Einsteinian) ones. 
Thus, in order to discuss the general aspects of the relativity principle Section 
49* has been added. 

The author’s views on the theory are explicitly formulated in different parts 
of the book and are implicit in the reasoning throughout the whole text. Their 
general trend is to lay stress on the Absolute rather than on the Relative. 
The basic ideas of Einstein’s Theory of Gravitation are considered to be: (a) the 
introduction of a space-time manifold with an indefinite metric, (b) the hypo- 
thesis that the space-time metric is not rigid but can be influenced by physical 
processes and (c) the idea of the unity of metric and gravitation. On the other 


* B.A. Dox, Teopua npocrpaHcTsa, BpCMeHM M TATOTEHMA, TocyjapcTBennoe l3f,aTeIbCTBO 
TeXHMKO-TeopeTH4eCKoli IMTepaTypH, Mockaa, 1955 


xi 


hand, the principles of relativity and of equivalence are of limited application 
and, notwithstanding their heuristic value, they are not unrestrictedly part of 
Eimstein’s Theory of Gravitation as expressed by the gravitational equations. 
The author expresses once more his thanks to Professor Kemmer for his 
excellent translation and considers his efforts as a token of friendship. 


Leningrad, June 1962 V. A. Fock 


INTRODUCTION t+ 


From the geometrical point of view the theory of space and time naturally 
divides into the theory of uniform, Galilean, space and the theory of non- 
uniform, Riemannian and Einsteinian, space.t 

Galilean space is of maximal uniformity. This means that in it: 

(a) All points in space and instants in time are equivalent. 

(b) All directions are equivalent, and 

(c) All inertial systems, moving uniformly and in a straight line relative to 
one another, are equivalent (Galilean principle of relativity). 

The uniformity of space and time manifests itself in the existence of a group 
of transformations which leave invariant the four-dimensional distance or 
interval between two points. The expression for this interval plays an important 
part in the theory of space and time because its form is directly related to the 
form taken by the basic laws of physics, viz. the law of motion of a freé mass- 
point and the law of propagation in free space of the front of a light wave. 

The indications (a), (b) and (c) of the uniformity of Galilean space are related 
to the following transformations: 

(a) To the equivalence of all points and instants corresponds the transforma- 
tion of displacing the origins of the spatial coordinates and of time; the 
transformation involves four parameters, namely, the three space 
coordinates and the time coordinate of the origin. 

(b) To the equivalence of all directions corresponds the transformation of 
rotating the spatial coordinate axes; this involves three parameters, the 
three angles of rotation. 

(c) To the equivalence of inertial frames corresponds a change from one 
frame of reference to another moving uniformly in a straight line with 
respect to the first; this transformation involves three parameters, the 
three components of relative velocity. 

The most general transformation involves ten parameters. This is the 

Lorentz transformation. 

It is well known that in a space of dimensions the group of transformations 
which leave invariant the expression for the squared distance between infinitely 
near points, can contain at most 4n(n + 1) parameters. If there is a group 
involving all 4n(n + 1) parameters then the space is of maximal uniformity; 
it may be a space of constant curvature, or, if the curvature vanishes, a 
Euclidean or pseudo-Euclidean space. 


+ In this introduction we frequently use terms and concepts which will be defined precisely 
only in the subsequent text. This is somewhat inconsistent but, unfortunately, unavoid- 
able, for it is our aim to give in this introduction an outline of our attitude to the theory 
discussed in this book. Since, however, we hope that the intending reader has some prior 
knowledge of the subject we feel no great inconvenience will be caused by this manner of 
presentation. If a reader does not fully understand what is said, he will be well advised to 
return to the introduction after reading the book. 


t We shall often use the word “space” short for “ space and time”. 
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In the case of space-time the number of dimensions is four and therefore the 
greatest possible number of parameters is ten. This is also the number of 
parameters in the Lorentz transformation, so that Galilean space, to which the 
transformation relates, is indeed of maximal uniformity. 

It is customary to call the theory based on the Lorentz transformations the 
Special Theory of Relativity. More precisely, the subject of that theory is the 
formulation of physical laws in accordance with the properties of Galilean 
space. The creator of the theory of relativity was Albert Einstein (1879-1955). 
Poincaré and Lorentz should be considered his predecessors. In this book 
Chapters I-IV are devoted to the theory of Galilean space. 

Universal gravitation does not fit into the framework of uniform Galilean 
space. The deepest reason for this fact was given by Einstein. It is that not 
only the inertial mass, but also the gravitational mass of a body depends on 
its energy. 

It proved possible to base a theory of universal gravitation on the idea of 
abandoning the uniformity of space as a whole} and attributing to space only a 
certain kind of uniformity in the infinitesimal. Mathematically, this meant 
abandoning Euclidean, or rather pseudo-Euclidean, geometry in favour of the 
geometry of Riemann. The modern theory of gravitation was also created by 
Einstein. 

According to gravitational theory there still exists in the infinitely small a 
uniformity like the one expressed by Lorentz transformations. That this is so 
is connected with the fact that in the vicinity of a given point of space the gravi- 
tational field can be imitated by a field of acceleration (the so-called Principle 
of Equivalence). The physical basis of this is the law already known to Galileo 
that in the absence of a resistive medium all bodies fall equally fast, or, more 
accurately, with equal acceleration. The law of Galileo can be stated in general- 
ized form as the law of the equality of inertial and gravitational mass. It 
should be stressed that this fundamental law is of a general character whereas 
the principle of equivalence is strictly local. The law of the equality of inertial 
and gravitational mass applies to the extent to which it is possible to define 
inertial and gravitational properties of mass as attributes of a particular body, 
independently of the positions and motions of other bodies. (Such indepen- 
dence exists in Newton’s theory of gravitation, but does not necessarily hold 
in generalizations of that theory). 

The law of equality of inertial and gravitational mass is the simple physical 
basis on which a theory of gravitation can be constructed; of course, the very 
process of constructing a theory involves introducing new generalizations. As a 
result of this the equality law may acquire the status of an approximate law, 
but this does not detract from its fundamental character. As a basis for con- 
oo he? sek sa ornare beoine a whole en conditions at infinity ”, etc., not literally, 

: pted in field theory. By “‘ space as a whole” we mean a 
Ba ae ee = sree ae to the systems we are considering is 
dimensions of the region can differ ae ee ae ena: a see 
an atom or a molecule a distance of the order of @ micron can be ete) ar saegel cee) 
for the solar system, a light year, for the galactic system, a thousand mi fas : . itely great, 

, : on light years. We 


never use se space asa whole ” to denote the Universe. A consideration of the Universe seems 
to us impossible for gnosiological reasons. 
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structing a more general theory, the law of equality of inertial and gravitational 
mass has the following advantage over the principle of equivalence: in the 
first place it is not confined to local considerations, in the second it avoids 
dealing with frames of reference in accelerated motion—the precise definition 
of which is not easy to make. 

In order to construct a theory of gravitation or to apply it to physical 
problems it is, however, insufficient to study space and time only locally, i.e. 
in infinitely small regions of space and periods of time. One way or another 
one must also characterize the properties of space as a whole. If one does not 
do this, it is quite impossible to state any problem uniquely. This is particularly 
clear in view of the fact that the equations of the gravitational, or any other 
field, are partial differential equations, the solutions of which are unique only 
when initial, boundary or other equivalent conditions are given. The field 
equations and the boundary conditions are inextricably connected and the 
latter can in no way be considered less important than the former. But, in 
problems relating to the whole of space, the boundary conditions refer to 
distant regions and their formulation requires knowledge of the properties of 
space as a whole. 

One should note that Einstein did not fully appreciate the inadequacy of a 
local description and the importance of boundary conditions. This is why it is 
necessary to change substantially Einstein’s statement of the basic problems 
of gravitational theory; this has been done in the author’s research and in 
this book. 

The simplest and at the same time the most important case is when one can 
assume space to be uniform at infinity in the sense of the Lorentz transforma- 
tion. In this case the non-uniformities arising from material bodies will have 
local character. Masses and their gravitational fields will be, so to speak, 
embedded in unlimited Galilean space. This case is particularly important 
because the existence of integrals of motion is connected with the uniformity 
of space at infinity. Only if space at infinity allows the full Lorentz trans- 
formation with ten parameters do there exist the ten integrals of motion, 
including the energy integral. 

Chapters V, VI and VII of this book are almost solely devoted to the case 
of a space uniform at infinity. 

The assumption is also possible that space-time as a whole is not completely, 
but only partially, uniform; viz., that as before an arbitrary shift of the origin 
of space coordinates and an arbitrary rotation of axes are permissible, giving 
six parameters, but that four parameters of the Lorentz transformation, i.e. the 
three components of relative velocity and the origin of time reckoning, are 
fixed in terms of the first six. Such a space-time was first considered by 
Friedmann and because the spatial part of it obeys Lobachevsky’s geometry, 
it may be called the space of Fnedmann—Lobachevsky. In contrast to Galilean 
space, this space allows a well-defined gravitational field to exist when the mean 
density of ponderable matter is different from zero. For this reason, one can 
surmise that when in cosmology one considers enormous regions, thousands of 
millions of light years in extension and approximately uniformly filled with 
galaxies, the space of Friedmann-Lobachevsky will be a better approximation 
to reality than Galilean space. The theory of local non-uniformities in 
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Friedmann-Lobachevsky space has hitherto not been developed, and we devote 
only Sections 94-95 of this book to this space. Other assumptions concerning 
the properties of space as a whole are also possible, but we shall not discuss 
them. 

For the question of whether preferred systems of coordinates exist, the 
properties of space as a whole are also of primary importance. 

In Galilean space the familiar cartesian coordinates together with time, are 
preferred. This set of variables is called the set of Galilean coordinates. The 
privileged position of these coordinates arises from the fact that in terms of 
them the Lorentz transformations, which express the uniformity of space, 
prove to be linear. 

In the case of a space uniform only at infinity it also proves possible to 
introduce a preferred system of coordinates, which is well defined up to a 
Lorentz transformation: they are the harmonic coordinates. This fact, first 
proved in the author’s work, has great importance in matters of principle. 
Only with its support can one show in particular that Einstein’s gravitional 
theory retains for the heliocentric Copernican system a preferred position 
compared to the geocentric Ptolomaic one. A more detailed exposition of this 
is given in Sections 92-93. All concrete problems in gravitational theory 
discussed in this book are solved in harmonic coordinates. This ensures that 
the solutions are unique. 

It is probable that in Friedmann-Lobachevsky space there also exist some 
preferred systems of coordinates. However, this question has not been investi- 
gated; even the theory of local non-uniformity in this space has not as yet 
been formulated. 

On the question of the existence of privileged systems of coordinates Einstein, 
the founder of gravitational theory, maintained a point of view opposite to 
ours, denying their existence in all cases. This is connected with his afore- 
mentioned preference for the local method of discussing properties of space 
(which method is the basis of Riemannian geometry), and his underestimation 
of the importance of considering space as a whole. Undoubtedly the philo- 
sophical attitude of Einstein, influenced throughout his life by the ideas of 
Mach, played its part in this. 

An important place in the theory of space and time is taken up by the question 
of different coordinate systems and the change in the appearance of equations 
on going from one such system to another. 

Great importance is usually attached to the covariance of equations. By 
this is meant the following: We consider a transformation of the coordinates 
accompanied by a transformation of dependent variables, or functions, according 
to some specified rule, for instance, the tensor rule. We examine the appearance 
of the equations satisfied by the original and the transformed functions. If, as 
& result of the transformation, the new functions of the new variables satisfy 
equations of the same appearance as the old functions of the old variables, the 
equations are called covariant. Covariance permits one to write down equations 
without presupposing any particular coordinate system. In addition, the 
Tequirement of covariance of equations has great heuristic value because it 
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limits the variety of possible forms of equations and thereby makes it easier 
to choose correct ones. However, one should stress that the equations can so 
be limited only under the necessary condition that the number of functions 
introduced is also limited; “if one permits the introduction of an arbitrary 
number of new auxiliary functions, practically any equation can be given 
covariant form. 

Thus, covariance of equations in itself is in no way the expression of any 
kind of physical law. For instance, in the mechanics of systems of mass-points 
Lagrange’s equations of the second kind are covariant with respect to arbitrary 
transformations of the coordinates, although they do not express any new 
physical law compared to, for instance, Lagrange’s equations of the first kind, 
which are stated in cartesian coordinates and are not covariant. 

In the case of Lagrange’s equations covariance is achieved by introducing as 
new auxiliary functions the coefficients of the Lagrangian considered as an 
expression quadratic, but not necessarily homogeneous in the velocities. 

In Riemannian geometry the new functions at our disposal are the coefficients 
Ju, of the quadratic form for the squared infinitesimal distance. The law 
according to which these functions transform in passing to a new coordinate 
system follows from their definition as coefficients of a quadratic form, together 
with the condition that this form is an invariant; in the following we shall 
always assume that a transformation of the coordinates is accompanied by a 
transformation of the y,, according to this law. The set of quantities g,,, is 
called the metric tensor. 

Having introduced the metric tensor, one can form expressions that are 
covariant with respect to any coordinate transformation. If we consider only 
those metric tensors which are obtainable from a particular one (e.g. from the 
Galilean tensor) by coordinate transformation, this will give us nothing new 
except the fact that our equations are covariant. But if we extend the dis- 
cussion to metric tensors of a more general form, tensors which cannot be 
transformed into one another by coordinate transformations, the generalization 
so obtained is an essential one. In this case the metric tensor will express not 
only properties of the coordinate system but also properties of space, and these 
latter can be related to the phenomenon of gravitation. It is Einstein’s idea 
that the description of the gravitational field demands the introduction of no 
functions other than’the metric tensor itself. This idea of the unity of metric 
and gravitation is the very basis of Eimstein’s theory of gravitation. It leads 
. almost uniquely to the equations of the gravitational field found by Einstein. 

Having clarified the concept of covariance as applied to Riemannian geo- 
metry, let us now consider it together with the previously discussed concept of 
the uniformity of space. As was shown above, the property of uniformity in 
Galilean space manifests itself in the existence of transformations that leave 


unchanged the expression for the four-dimensional distance between two points. 
More precisely, these transformations leave unchanged the coefficicnts of this 


expression, i.e. the quantities g,,. If the g,, are functions of the coordinates we 
mean by this that the mathematical form of these functions is unchanged: the 
dependence of the new y,, on the new coordinates has the same mathematical 
form as that of the old g,, on the old coordinates. In the gencral case of 
Riemannian geonictry there are no transformations that leave they, unchanged 
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because Riemannian space is not uniform. In Riemannian geometry one deals 
with transformations of coordinates accompanied by transformations of the g,, 
and neither such a combined transformation nor covariance with respect to it, 
has any relation to the uniformity or non-uniformity of space. 

These statements can be illustrated by a simple example. 

Let us consider the surface of a sphere. This surface can be transformed into 
itself by means of rotations by any angle around any axis passing through the 
centre. Such a transformation involves three parameters, but since the surface 
of the sphere is two-dimensional it follows from the remarks at the beginning of 
this introduction that it will be of maximum uniformity. On the surface of a sphere 
there are no preferred points or preferred directions. 

Let us now consider a more general non-spherical surface of revolution. It also 
can be transformed into itself by rotation about an axis, but now the direction 
of the axis is fixed and only the angle of rotation remains arbitrary, so that the 
allowed transformations involve only a single parameter. Certain points on the 
surface will be singled out (the poles through which the axis passes) and so will 
certain directions (meridians and latitude circles). Finally, if we consider a surface 
of general form, there will be no transformations taking it into itself. Thus the 
sphere, a surface of a very particular form, possesses the greatest degree of 
uniformity (3 parameters), a surface of revolution has partial uniformity (1 para- 
meter), while a surface of general form possesses no uniformity whatsoever. 

We see that the generality of the form of the surface is a concept antagonistic 
to the concept of uniformity. This conclusion remains valid for the geometry of 
the 4-dimensional space-time manifold: the generality of its geometry is an- 
tagonistic to its uniformity. 

We shall now clarify the meaning given to the concept of relativity in the 
theory of space and time and will show that it is extremely closely related to 
the concept of uniformity. 

When speaking of the relativity of a frame of reference or simply of relativity, 
one usually means that there exist identical physical processes in different 
frames of reference (for more detail see Sections 6 and 49*). According to the 
generalized Galilean principle of relativity identical processes are possible in all 
inertial frames of reference related by Lorentz transformations. On the other 
hand, Lorentz transformations characterize the uniformity of Galilean space- 
time. Thus the principle of relativity is directly related to ywniformity. This 
also shows that the nomenclature introduced in Einstein’s first papers, by 
which the theory of uniform Galilean space is named “ Theory of Relativity ” 
can to some extent be justified.} 

In the following we shall see that relativity in the sense defined above is 
related to uniformity in all those cases in which the space-time metric can be 
considered fixed. This can be done not only in the theory of Galilean space, 
where the metric is given once and for all, but also in the theory of Riemann 
and Einstein space, provided only the processes under discussion have no prac- 
tical influence on the metric. In these cases relativity is uniquely related to 
uniformity : in uniform Galilean space it exists, innon-uniform Riemannian space 
it is absent. If, however, the discussion is to include processes which themselves 
essentially influence the metric (displacement of heavy masses) it will be shown 


t These ideas were expressed by E. Cartan [1]. 
¢ The name Chronogeometry suggested by A. D. Fokker [la] would be more appropriate. 
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at the end of the book (Section 93) that relativity can to a certain extent be 
retained, even in inhomogeneous space, albeit only in those cases in which the 


non-uniformity produced by heavy masses may be treated as a local perturbation 
in infinite Galilean space. 


The preservation of a certain degree of relativity, even in non-uniform space, 
is due to the fact that in going over to a new frame of reference the previous 
gravitational field is replaced by a new one, which has the same form in the new 
frame as the old in the old frame. Thus, if a laboratory on the Earth is turned 
upside down, the processes in it will be disturbed (there is no relativity) but if 
the reversal is accompanied by a parallel transport of the laboratory to the anti- 
podes, the course of all processes in it will be the same as in the beginning (rela- 
tivity is maintained). 

Thus, even in non-uniform space the still existing relativity is indirectly 
connected with uniformity, namely with the uniformity at infinity if one dis- 
cusses relativity “in the large”, and with local uniformity in a geodesic co- 
ordinate system if one is discussing relativity ‘“‘in the infinitely small”. We 
have seen, however, that uniformity stands in an antagonistic relation to the 
degree of generality of the geometry ; this is why relativity either does not exist 
at al] in a non-uniform space with Riemannian geometry, or else it does exist, 
but does not go beyond the relativity of Galilean space. But there can be no 
question of a generalization of the concept of relativity in going over to non- 
uniform space. 

However, when Einstein created his theory of gravitation, he put forward 
the term “ general relativity’? which confused everything. This term was 
adopted in the sense of “ general covariance ”, i.e. in the sense of the covariance 
of equations with respect to arbitrary transformations of coordinates accom- 
panied by transformations of the g,,. But we have seen that this kind of 
covariance has nothing to do with the uniformity of space, while in one way or 
another relativity is connected with uniformity. This means that “ general 
relativity ” has nothing to do with “ relativity as such”. At the same time the 
latter received the name “ special” relativity, which purports to indicate that 
it is a special case of “ general ”’ relativity. 

To show what misunderstandings resulted we shall consider some examples. 

As will be shown in Chapter IV, the theory of uniform Galilean space can be 
formulated not only in a way covariant with respect to Lorentz transformations, 
but also in a generally covariant manner. In the language of “ general and 
“ special ” relativity it would be extremely difficult to express this simple idea, 
and we shall not attempt to do so, for we would have to say that “ special ” 
relativity includes “ general ” relativity or something of that kind. 

Remembering that even in Newtonian mechanics one deals with the generally 
covariant Lagrange equations of the second kind, one would also have to say 
that Newtonian mechanics contain in itself “ general ” relativity. 

It has become a convention, since Einstein, to use the term “ general ” 
relativity or “ the general principle of relativity ” im the sense of " theory of 
gravitation”. Einstein’s fundamental paper on the theory of gravitation (1916) 
is already entitled “ Foundations of the General Theory of Relativity ” This 
confuses the issue still further because the words “‘ general ” and “ relativity 7 
are not used with their proper meaning. In the theory of gravitation, space is 
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‘ 


assumed non-uniform whereas relativity relates to uniformity so that it appears 
that in the general theory of relativity there is no relativity. 

Enough has been said to make clear that the use of the terms “ general 
relativity ”, “ general theory of relativity ’’ or “ general principle of relativity ” 
should not be admitted. This usage is inconvenient because it not only leads 
to misunderstanding, but also reflects an incorrect understanding of the theory 
itself. However paradoxical this may seem, Hinstein, himself the author of 
the theory, showed such a lack of understanding when he named his theory and 
when in his discussions he stressed the word “ general relativity ”’, not seeing 
that in the new theory he had created, the notion of relativity was not among 
the concepts subjected to generalization. 

In the present book we do not use the term “ general relativity ”. Following 
established practice, we call the theory of Galilean space the Theory of Relativity, 
but without the adjective “ special’. We call the theory of Einstein space the 
Theory of Gravitation, not the ‘general theory of relativity’, because the 
latter name is nonsensical 


‘ 


*x * * * 


The philosophical side of our views on the theory of space, time and gravi- 
tation was formed under the influence of the philosophy of dialectical 
materialism, in particular under the influence of Lenin’s “ Materialism and 
Empiriocriticism”, The teachings of dialectical materialism helped us to 
approach critically Hinstein’s point of view concerning the theory created by 
him and to think it out anew. It helped us also to understand correctly, and to 
interpret, the new results we ourselves obtained. We wish to state this here, 
although this book does not explicitly touch philosophical questions. 


CHAPTER I 


THE THEORY OF RELATIVITY 


. Coordinates of Space and Time : 


Space and time are primary concepts. In the philosophical sense they are 
the forms of existence of matter. This can be understood as meaning that the 
concepts of space and time are obtained by appropriate abstractions from the 
concept of spatio-temporal relations between material processes. The simplest 
notion relating to space and time is a point in spacc considered at a particular 
instant in time. To ‘‘ mark” a point in space one must place there a material 
body of sufficiently small dimensions. The position of this body can only be 
fixed relative to other material bodies since there does not exist any cartographic 
grid inherent in space and independent of material bodies. When a frame of 
reference, or “ base”, is chosen, i.e. a system of material bodics relative to 
which the position of the given point body is measured, that position can be 
fixed in terms of threc coordinates relating to a particular instant in time, 
reckoned according to a clock at the base. Generally speaking coordinates are 
auxiliary quantities characterizing the disposition of bodies relative to the frame. 
They allow one to calculate according to the laws of Euclidean geometry the 
relative positions of the bodies, in particular their distances apart and the angles 
between the directions of lines joining them. Usually one takes the coordinates 
to be rectilinear and orthogonal because such coordinates are simplest to relate 
to lengths and distances, but any other (curvilinear) coordinates are admissible, 
for instance, two angles defining the direction towards a point-body and the 
distance to it. 

One should stress that both the coordinates themselves and the distances, 
angles, etc., calculated from them in describing the relative positions of bodies 
have a definite meaning only when a particular base is postulated. In the same 
way instants in time to which the coordinates and the distances refer, and 
likewise time differences, become well-defined only when a definite spatial base 
and a definite time-measure in that base are assumed, i.e. a definite frame of 
reference. 

In this way the variables z, y, z and t, the space and time coordinates with 
which we shall deal in the following, are related to a base. 


2. The Position of a Body in Space at a given Instant, in a Fixed 
Reference Frame 
Let us first disregard time and consider the usual methods for determining 
the situation of objects in space. The logical foundation of these methods is, 
in principle, the hypothesis that Euclidean geometry 1s applicable to real 
physical space together with two further assumptions, viz. that rigid bodies 
exist and that light travels in straight lines. In fact, to find the position of a 
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distant object one has to use a rigid graduated rule to map out a definite base 
in the sense of ordinary triangulation, and a light ray to take bearings on the 
object at different points of the base. Assuming that the light rays are straight 
lines the laws of Euclidean geometry then allow one to calculate the distance 
to the object and all other data characterizing its position. The rectilinear 
nature of light rays in vacuo is a basic postulate. In the atmosphere light rays 
are only approximately straight and corrections for refraction, etc., must be 
made. The validity of the laws of Euclidean geometry in the real physical 
world should be taken as an experimental fact, not as an a priori assumption. 
Indeed, although these laws can be verified with an enormous degree of accuracy, 
Einstein’s theory of Universal Gravitation is based on the consideration of 
small departures from them. 

Thus the properties of light and the properties of rigid bodies play a basic 
part in establishing the geometry of real physical space. 

However, one must note that in this connection the concept of a rigid body is, 
to some extent, auxiliary. Absolutely rigid bodies do not exist; one can take 
actual physical bodies to be rigid and to possess invariable geometric dimensions 
only as an approximation and only in certain conditions, namely at constant 
temperature, in the absence of elastic vibrations, etc. The invariability of a 
standard of length can be checked with greatest accuracy by comparing it with 
the wave length of some definite spectral line. Thus the notion of length ulti- 
mately depends on the properties of atoms or molecules radiating the spectral 
line and on the properties of light. 

Another method for determining the disposition of objects in space, which 
is in principle different from triangulation, is radiogeodesy or radar. Basically 
this method consists of emitting radio signals from a certain point and reflecting 
them from the object to be observed so that they return to the point of origin. 
The time elapsing during the passage of the signal there and back is noted 
and also, of course, the direction. Knowing the speed of propagation of a radio 
signal—it is in fact equal to the speed of light—one obtains the distance to the 
object as the product of the speed and half the time taken by the signal. 

As a matter of principle this method is important because it reduces the 
measurement of length to a measurement of time intervals and does not make 
use of the properties of absolutely rigid bodies. The essential assumption is 
‘that the speed of light is constant. Here this speed plays the part of a con- 
version factor from time to length. Its numerical value must be determined by 
other experiments and these do have to use a standard of length. 

In principle, a time measurement can be performed with the aid of any 
periodic process. At present the clocks of greatest precision rely on the proper 
vibrations of a quartz crystal or of an ammonia molecule. In practice astro- 
nomical time measurements are used which rely on applying Newton’s laws of 
motion to the rotation of the earth, with due observance of all deviations from 
uniform rotation that the theory predicts, such as nutation, etc. 

Such methods of time measurement make it possible to regulate a clock at 
the base. 

When measuring the position of moving bodies from a fixed base the question 
arises: to what instant, by the clock at base, does the measured value of distance 
or any other spatial coordinate refer? We accept the following definition: If 
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1e light signal (or the signal from the radar station) is sent off at a time 4 
nd returns at ¢,, then the value of the coordinates z, y, z of the object and also 
he distance r = c(t, — ¢,)/2, refer to the instant ¢ = (t, + ¢,)/2. This definition 
sin accord with the natural assumption that the speeds of light to and fro 
Te equal. 

We have shown that, starting from a definite base and equipped with gradu- 
ted rules, clocks and other necessary apparatus, we can measure out the 
osition of bodies in space with reference to the base and refer the resulting 
alues of coordinates, such as the cartesian ones 2, y and z, to a definite instant ¢, 
rhich also refers to a clock at base. In the following we shall call this kind of 
ase a reference frame. We shall still use the word ‘“‘base” in those cases when it 
| desirable to stress the fact already pointed out at the beginning of Section 1, 
amely, that a reference frame is not some kind of coordinate grid in space 
ogether with some “‘ universal time ”’, but is connected with rules and clocks 
rcated at a definite place and moving in a definite way. 


. The Law of Propagation of an Electromagnetic Wave Front 
The laws of propagation of light in empty space are thoroughly understood. 
‘hey find their expression in the well-known equations of Maxwell 


1 0E 
curlH — - — = Q, divE=0 
c at 


(3.01) 
1 oH : 
curlE + -—=0Q, divH = 0 
c at 


there E and H are the vectors of electric and magnetic field intensity. However, 
re are not interested in the general case of light propagation, but only in the 
ropagation of a signal advancing with maximum speed, i.e. the propagation 
f a wave front. Ahead of the front of the wave all components of the field 
anish. Behind it some of them are different from zero. Therefore, some of 
he field components must be discontinuous at the front. 

On the other hand, given the field on some surface moving in space, the 
erlvatives of the field on the surface are, in general, determined by Maxwell’s 
yuations. Hence the value of the field at an infinitely near surface is also 
etermined and discontinuities of the field are impossible, The only case when 
1is is not so is when the form and motion of the given surface satisfies certain 
declal conditions subject to which the value of the derivatives is not determined 
y the values of the field components themselves. Such a surface is called a 
iaracteristic surface or, briefly, a characteristic. Thus, discontinuities of the 
eld can occur only on a characteristic, but since there must certainly be dis- 
ontinuities at a wave front, such a front is clearly a charactenstic. 

Let us determine the equation of a characteristic for the system of Maxwell’s 
juations. 

Let the value of the ficld be given for those points and instants whose 
vordinates are related by the equation 


t= fly) (3.02) 


1 particular, if f = 0 this a:nounts to stating initial conditions. Equation (3.02) 
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may be looked upon as the equation of a certain hypersurface in the four-dimen- 
sional space-time manifold. When (grad f)?> 1 the same equation can be 
considered as the equation of an ordinary surface moving through space (see 
also Section 35). Assume that on the hypersurface (3.02) the values of a certain 
function u are given 


ua Y, 2, f) = U(2, ¥, 2) (3.03) 


The following combinations of derivatives with respect to space and time 
coordinates will thereby also be given 


du lew of AX, ; 

Gat al on oe (¢ = 1, 2, 3) (3.04) 
Here we have used 2,, %, £, for brevity instead of z, y, z. If, in addition, the 
derivative du/&t were given, the values of all the derivatives of u at the surface 
would be known. 

Let us take as the function u one of the components of electromagnetic field, 
for instance Ez, and denote by E® = E9(z, y, z) the value of this component 
on the hypersurface. We shall use the same notation (superfix 0) for the other 
components. If the field is given on the hypersurface, the whole set of quantities 
E° and H° can be considered as known functions of z, y and z. As in (3.04) we 
shall have for instance 


roy Dy 1 ok, @ ok? 
mses gS Oameainaeg of = — ete. (3.05) 
oz c a @ oz 


whence div E + . (era : 5) = Div E° (3.06) 
curl E + : jerad / x 3 = Curl E° (3.07) 
and also div H + . (era ip: =) — Div H° (3.08) 
curl H + - ered f x =| = Curl H® (3.09) 


In the last equations we have denoted the operators div and curl when applied 
to E° and H®° by Div and Curl. 

The values of the six functions E° and H® are not independent: they must 
satisfy two relations which we shall now give. 

Taking the scalar product of equations (3.07) and (3.09) with grad f and 
using Maxwell equations (3.01) we get 


(grad f - Curl E°) = — . (ena : =) (3.10) 


(grad f - Curl H°) = . (era . =) (3.11) 


Now the first terms on the left-hand sides of both equations (3.08) and (3.06) 
vanish by virtue of Maxwell’s equations and the second terms are the same as 
the right-hand sides of (3.10) and (3.11). We have, therefore, 


Div H® + (grad f:Curl E°) = 0 (3.12) 
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Div E° — (grad f- Curl H°) = 0 (3.13) 


These are the conditions which the given functions E° and H® must satisfy. 
In the cace that f = const., we have to choose initial values in the ordinary way 
and then the conditions amount to the obvious requirement that div E and 
div H must both vanish at the initial instant. 

Using Maxwell’s equations we can write (3.07) and (3.09) in the form 


dE oH 
aes jerad f x =| = c Curl H° (3.14) 
oH oE 

ie oh jerad f x 5 = ¢ Curl E° (3.15) 


Multiplying these equations vectorially by grad f and using the relations just 
deduced, we transform them into : 


OE 
{1 — (grad f)?} = = E° — (grad f -E°) grad f — [grad f x H®] (3.16) 


2: 
{1 — (grad f)"} = = H® — (grad f- H°) grad f + [grad f x E°] (3.17) 


Here we have written for brevity 
B°=cCurlH®; H° = —cCurlE° (3.18) 


The functions on the right-hand side of (3.16) and (3.17) are all known. 
These equations can be solved for the time derivatives if their coefficient, i.e. 
the quantity 1 — (grad f)?, is not zero. In that case equations analogous to 
(3.05) lead to finite values of all the other first derivatives of the field, so that 
the field itself will be continuous on the surface (3.02). If there is to be a 
discontinuity there, the coefficient of the time derivatives must thus necessarily 
be zero, giving the condition 


(grad f)? = 1 (3.19) 


A surface satisfying this equation will be a characteristic. If we write the 
equation of a surface without solving for time, but instead in the form 


w(z, ¥, 2, t) = 0 (3.20) 

the equation (3.19) for the characteristic takes on the form 
1 /Ga\?2 oe 39] 
= (3) — (grad w)? = (3.21) 


Therefore the propagation of an electromagnetic wave front satisfies this 
equation (3.21). 
Particular functions satisfying the equation (3.19) or (3.21) are 
t= (ox by ty) @ +B +yt=)) (3.22) 
e 


Lato te vile— a)? + yy) +20) (3.28) 


The former gives a plane wave front, the latter a spherical one. 
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4. Equations for Rays 
The equation describing the propagation of a wave front can be written in 
the form 


“ + cy/(w?2 + w2 + w?) = 0 (4.01) 
(for definiteness we have chosen the plus sign before the square root). The 
form of this equation is the same as that of the Hamilton-Jacobi equation in 
mechanics with w playing the part of the action function S and the derivatives 
@z, Wy and w,, the part of the “‘ momenta”, pz, py and pz. Corresponding to 
the Hamiltonian we have here the expression 
H = c/w? + of + 0?) (4.02) 
To the trajectories of mechanics there correspond light rays. The equations 
for them are analogous to Hamilton’s equations. They can be written as 


ee Oe ne (4.03) 
dt @w, /(w?, + 2 + w2) 
duz oH 
a PEO Ld oe : 4.04 
dt on °, sd oe 


Equation (4.04) shows that the quantities wz, oy and wz are constant along a 
tay, though they can, of course, vary from one ray to another. Therefore the 
rays will be straight : 
OF 2 

° Voz Fat $03) 

If the sign of w, and hence of wz, wy and wz is changed, the direction of the 
ray is reversed ; the sign must be chosen according to the given sense of direction 
of the ray. 

Any wave surface can be considered as formed of points moving along the 
rays with the speed of light according to (4.05). 

We thus have the possibility of constructing a wave surface at time t when 
its form at time ¢, is known. Let the equation of the wave surface at time ty 
have the form 


Z—I%y = 


(t — t,), ete. (4.05) 


W(X, Yo Zp) = 0 (4.06) 
where 2p, ¥) and z, are coordinates varying over this surface. Knowing the 
equation of the surface we can calculate the quantities 


OF 
X(Zo, Yo, 29) = Geman earn ’ etc. (4.07) 
Here the sign of the right-hand sides is determined by the given direction of 
wave propagation. The equation of the ray passing through the point (ao, ¥9, Zo) 
of the initial wave surface is 

Z— XL) = ca(t —?t,) 

Y—YoruP—t) (0? +8? +7? =1) (4.08) 

Z — 2% = ey(t — to) 
The quantities z, y and z give the positions of the point to which the point 
(Zo, Yo. Zo) Moves at time t. Allowing 2, y, and z, to take on all values which 


satisfy w° = 0, we obtain from (4.08) all points which at time ¢ lie on the 
wave surface. 
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If we solve (4.08) for xo, y, and 2 and insert the functions 


Lo = X(z, y, z, t — ty), etc. (4.09) 
into the wave surface equation w° = 0, we get the relation 
w(z, y, z,t —t,) = 0 (4.10) 


which is the explicit form of the equation of the wave surface at time t, At 
t = ty, obviously, 2) = 2, yy = y, 29 = z and equation (4.10) reduces to (4.06), 
which is the equation of the initially given wave surface. 
From the ray equation (4.05) there follows the relation 
o*(t — bo)? — [(a — a9)? + (y — yo)? + (2 — 2)*] = 0 (4.11) 
which connects the coordinates of the initial and final points on each ray. It 
is the equation of a sphere centred at the point 2, Yo) Zo and of a radius 
R = c(t — t,) that increases linearly with time. Just as equation (3.21), from 
which we started, this equation expresses the fact that the velocity of light 
propagation is constant. 
For points infinitesimally separated relation (4.11) takes on the form 
c? dt? — (dx® + dy? + dz®) =0 (4.12) 
In this form the equation follows directly from Hamilton’s equation (4.03). 
A more general treatment of the problem of integrating the wave front equa- 
tion is given in Appendix F. 


5. Inertial Frames of Reference 

In Section 2 we explained what is meant by a frame of reference and showed 
that it is essentially a certain base equipped with graduated rules and clocks ; 
it can be roughly represented as a kind of radar station. The important thing 
is that the base consists of certain material bodies that have some definite 
position in space and some definite motion. 

To describe any phenomenon, in particular propagation of light or motion of 
material bodies, requires a definite frame of reference or base. For example, 
the motion of the planets is normally described in the heliocentric system. 
The origin of this system, i.e. the position of the imaginary base, is the mass- 
centre of the solar system and the three axes are chosen to point to three fixed 
stars, by which choice the orientation of the base is fixed. 

In general, the mathematical form of the laws of nature will be different in 
different frames. Thus, for instance, the motion of bodies relative to the Earth 
may be described either in a frame with axes pointing to three fixed stars or in 
one rigidly fixed to the Earth and in the latter case one has to introduce Coriolis 
forces into the equations of motion. 

There exist frames of reference in which the equations of motion have a 
particularly simple form; in a certain sense these are the most “ natural ” 
frames of reference. They are the inertial frames in which the motion of a body 
is uniform and rectilinear, provided no forces act on it. (Here the question 
arises how to tell that no forces are acting on the given body; we shall take 
it that this is the case if all bodies capable of exerting forces are sufficiently 
distant.) The heliocentric frame is an inertial one to a very good approximation. 
In an inertial frame we shall almost always use cartesian coordinates, because 
in terms of them it is particularly simple to formulate both the laws of Euclidean 
geometry and the laws of mechanics, in particular, the aforementioned law of 
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the rectilinear uniform motion ‘of a body not subject to forces. Of course, the 
very concept of an inertial frame permits one to go over to other, especially 
curvilinear, coordinates and if the equations of the transformation do not 
involve time explicitly one can take it that both the initial and the transformed 
coordinates refer to the same inertial frame. 

In pre-relativistic physics the notion of an inertial system was related only 
to the laws of mechanics. Newton’s first law of motion is, in fact, nothing but a 
definition of an inertial frame. 

However, we have seen that the laws of light propagation play a fundamental 
part in the definition of the basic concepts relating to space and time. Therefore 
it proves more correct to relate the notion of an inertial frame not only to the 
laws of mechanics but also to those of light propagation. 

The usual form of Maxwell’s equations refers to some inertial frame. It is 
obvious and has always been assumed, even before Relativity, that at least one 
reference frame exists that is inertial with respect to mechanics and in which 
at the same time Maxwell’s equations are true. The law of propagation of an 
electromagnetic wave front in the form 


(a) — (a) +(q) +(e) |= 6 
c?\ a Ox oy oz 
also refers to this inertial frame. A frame for which (5.01) is valid may be 
called inertial in the electromagnetic sense. A frame that is inertial both in 
the mechanical and in the electromagnetic senses will be simply called inertial. 

Thus, by the definition we have adopted, an inertial frame is characterized 
by the following two properties: 

1. In an inertial frame a body moves uniformly and in a straight line, provided 
no forces act on it. (The inertial property in the usual mechanical sense.) 

2. In an inertial frame the equation of propagation of an electromagnetic 
wave front has the form (5.01). (The inertial property for the field.) 

We have spoken only of the propagation of an electromagnetic wave, thereby 
apparently giving this field preference over other fields. However, the preference 
is only apparent. The maximum speed of propagation of all fields must actually 
be the same, so that equation (5.01) is of universal validity. This question will 
be discussed in detail in the following section. 


6. The Basic Postulates of the Theory of Relativity 

The fundamental postulate of the theory of relativity, also called the Principle 
of Relativity, asserts that phenomena occurring in a closed system are inde- 
pendent of any non-accelerated motion of the system as a whole. 

We shall attempt to make the content of this postulate more precise. 

Assume a closed system of material bodies and a variety of mechanical, 
electromagnetic or any other processes taking place within the system. We shall 
describe the state of the system, including the electromagnetic and other fields 
involved, with reference to some definite frame, or base in the sense of Section 2. 
Consider two frames. Let the first be inertial and let the second move uniformly 
in a straight lme with respect to the first. We define the initial state of the 
system in relation to the first frame, using the word “ initial” in the sense of 

} See L. I. Mandelstam [6]. 
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“relating to the initial instant according to a clock of the frame”. We then 
imagine a copy of our system which has the same initial state with respect to 
the second frame as the original system had with respect to the first, with 
“initial” now, of course, defined with respect to the other frame. We shall 
compare the subsequent course of events in the original system, described in the 
first frame, with the course of events in the copy, as described in the second frame. 

The Principle of Relativity asserts that the two sequences of events will be 
exactly the same (at least insofar as they are determined at all). If a process 
in the original system can be described in terms of certain functions of the space 
and time coordinates of the first frame, the same functions of the space and time 
coordinates of the second frame will describe a process occurring in the copy. 

One can say more descriptively, if less precisely, that the uniform rectilinear 
motion of a material system as a whole has no influence on the course of any 
processes occurring within it. 

Many natural laws governing the course of various physical processes can 
be stated in terms of differential equations, the form of which does not depend 
on the initial state of the system concerned. The initial conditions enter as 
supplementary equations. This is so, for example, for the equations of motion 
in mechanics, and also for the equations of the electromagnetic field and the 
associated equation for the propagation of an electromagnetic wave front. It 
follows then from the Principle of Relativity that the mathematical form of 
such laws must be the same in the initially chosen frame and in any other 
which moves uniformly and in a straight line with respect to it. Remembering 
our definition of an inertial frame (in the mechanical sense and for fields) we 
are led to the following important conclusion : If an inertial frame vs given, any 
other frame that moves uniformly and rectilinearly with respect to ut ts rtself inertial. 
This postulate will serve us as the basis from which to derive the formulae 
linking the space and time coordinates in two frames moving without acceleration 
with respect to one another. 

The Principle of Relativity is confirmed by the whole of our experience of 
the natural world. In the domain of mechanics the principle has long been 
known ; it is the Galilean relativity principle. Hinstein’s achievement was to 
extend it to all phenomena (though in the first place to electromagnetism) and 
to derive from it certain consequences regarding the interrelation of space 
and time. 

The Theory of Relativity can be built up on two postulates, namely, the 
Principle of Relativity and another principle that states that the velocity of 
light is independent of the velocity of its source. We have accepted the latter 
principle from the first, for we have based our development on the law of 
propagation of an electromagnetic wave front and the second principle is an 
immediate consequence of this law. 

It is appropriate to give here a generalized interpretation of the law of wave 
front propagation and to formulate the following general postulate a . 

There exists a maximum speed for the propagation of any kind of action. This 
is numerically equal to the speed of light in free space. _ 

This principle is very significant because the transmission of signals with 
greatest possible speed plays a fundamental part in the definition of concepts 
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concerning space and time. The very notion of a definite frame of reference 
for describing events in space and time depends on the existence of such signals. 
In Section 2 we have considered methods of determining the positions of bodies 
in space that were based on the use of light, or other electromagnetic, signals. 
In principle, however, signals could be transmitted not only by electromagnetic 
waves but by waves of some other nature. One could, for instance, imagine 
signalling with particles having the limiting speed and with the matter-waves 
that correspond to them in quantum mechanics. Also conceivable, though 
unattainable in practice, would be to use gravitational waves, the existence of 
which follows from Einstein’s theory of Gravitation (see Section 90). It is not 
excluded that other fields capable of transmitting signals might be discovered. 
Therefore the question arises: is the notion of a reference frame sufficiently 
general, if its definition is based on the use of electromagnetic signals only ? 
For if more rapid means of signalling existed, or even instantaneous transmission, 
the concept of a reference frame defined in terms of light propagation could not 
aptly reflect the properties of space and time but would, at best, give one 
possible form of description. 

The principle formulated above, by asserting the existence of a general upper 
limit for all kinds of action and signal, endows the speed of light with a universal 
significance, independent of the particular properties of the agency of trans- 
mission and reflecting a certain objective property of space-time. 

This principle has a logical connection with the Principle of Relativity. 
For if there was no single limiting velocity but instead different agents, 
e.g. light and gravitation, propagated in vacuo with different speeds, then the 
Principle of Relativity would necessarily be violated as regards at least one of 
the agents. 

The principle of the universal limiting velocity can be made mathematically 
precise as follows : 

For any kind of wave advancing with limiting velocity and capable of trans- 
mitting signals the equation of front propagation is the same as the equation for 
the front of a light wave. 

Thus the equation 

1 /éw\? 3 

3 (=) (grad w)? = 0 (6.01) 
acquires a general character ; it is more general than Maxwell’s equations from 
which we derived it. As a consequence of the principle of the existence of a 
universal limiting velocity one can assert the following: the differential 
equations describing any field that is capable of transmitting signals must be 
of such a kind that the equation of their characteristics is the same as the 
equation for the characteristics of light waves. 

We shall sce in Chapters V-VII that the presence of a gravitational field 
somewhat alters the appearance of the equation of the characteristics from the 
form (6.01), but in this case one and the same equation still governs the propa- 

¢ The work of A. D. Aleksandrov [2, 3] contains a formulation of the basic postulates of 
tho theory of relativity as.a theory of the “ absolute ” properties of space and time, which is 
close to ours. The ideas of A. D. Fokker [1a] and the name Chronogeometry proposed by 
him for this theory also underline the “absolute” side of spatio-temporal relations. 
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sation of all kinds of wave fronts travelling with limiting velocity, including 
lectromagnetic and gravitational ones. 

In the preceding section we have taken equation (6.01) together with the fact 
f the uniform rectilinear motion of a body not subject to forces, as the basis 
or defining inertial reference frames. 

The principle just formulated asserting the universality of the equation shows 
hat such a definition of inertial systems is appropriate. 


’. The Galileo Transformations and the Need to Generalize Them 


Let one and the same phenomenon be described in two inertial frames of 
reference. The question arises of passing from the first frame to the second in 
she description of some phenomenon. For a rough illustration we can imagine 
;wo radar stations, one on the Eartli, the other on an aeroplane or on a Sputnik; 
she problem is to convert from the data recorded at the first station to those 
‘ound at the second. 

For such a conversion one must know first of all the relation between the 
space and time coordinates z, y, z and ¢ in the first frame and the corresponding 
x’, y’, z’ and ¢’ in the second. In pre-relativity physics one accepted as self- 
evident the existence of a universal time #t, the same for all frames ; on that 
view one had to putt’ = tort’ =t — tg, if a change of time-origin was admitted. 

Considering two events occurring at times ¢ and +, the old point of view 
required the time elapsed between them to be the same in all reference frames 
so that 


t—t=t—T (7.01) 


Further, it was considered to be evident that the length of a rigid rod, measured 
in the two frames, would have the same value. (Instead of the length of a rigid 
rod one could equally well consider the distance between the 7 simultaneous ” 
positions of two points which need not necessarily be rigidly connected.) 
Denoting the spatial coordinates of the two ends of the rod (or the bee points) 
by (2, y, 2) and (E, », 7) in the one frame and by (2’, y’, 2’) and (&’, 7’, ¢’) in the 
other the old theory required 

— £2 — y)2 —_ 7/2 — (2’ — €’)2 Pah SCY 
(cg —E)2 +y—nP +2 —CP=(@ —EYP + —n)? + ( ia 


Equations (7.01) and (7.02) determine uniquely the general form of the trans- 
formation connecting z, y, z and ¢ with 2’, y’, 2 and ue It consists of a ae 
in origin of spatial coordinates and of time, of a rotation of the spatial axes an 
of a transformation such as 
g =a—Velt 
eae aay (7.03) 
2’ == Fae 
1 
where Vz, Vyand V; are constants whose physical meaning is easy to sce: they 
give the velocity with which the primed frame moves relatively to the unprimed 
one; morc exactly they are the components of this velocity in the unprimed frame. 


20 The Theory of Space Time and Gravitation 


The transformation (7.03) is called a Galileo transformation. 

Thus, pre-relativistic physics asserted that, given an inertial frame (2, ¥, 2), 
space and time coordinates in any other frame moving uniformly and recti- 
linearly relative to the former are connected with x, y, z and t by a Galileo 
transformation, apart from a rotation and a displacement of origin. 

Galileo transformations satisfy the Principle of Relativity as far as the laws of 
(Newtonian) mechanics are concerned, but not in relation to the propagation of 
light. Indeed the wave front equation changes its appearance when subjected to 
a Galileo transformation. If Galileo transformations were valid—and the Prin- 
ciple of Relativity in its generalized form therefore not—there would exist 
only one inertial system in the sense of our definition and the changed form of 
the wave front equation in any other frame would allow one to detect even 
uniform rectilinear motion relative to the single inertial system—the “‘ immobile 
ether ’’—and to determine the velocity of this motion. The fact that numerous 
accurate experiments devised to discover such motion relative to the “ ether ”’ 
all had negative results leaves no room whatever to doubt that the form of 
the law of wave front propagation is the same in all non-accelerated frames. 
Therefore the Principle of Relativity is certainly also applicable to electro: 
magnetic phenomena. 

It follows also that the Galileo transformation is in general wrong and should 
be replaced by another. 

The general postulates formulated in the preceding sections provide all that 
is required to derive a transformation of space and time coordinates to super- 
sede the Galileo transformation. 

The problem can be stated as follows. Let a reference frame be given which 
is inertial by our definition, i.e. in both the mechanical an] the electromagnetic 
senses (see Section 5). We denote the space and time coordinates in it by 2, y, 
z and t. Let another frame (2’, y’, 2’, t’) be given which is also inertial. The 
connection between (x’, y’, 2’, ’) and (a, y, z, t) is to be found. 

The problem of finding a transformation between two inertial frames is 
purely mathematical ; it can be solved without any further physical assumptions, 
other than the definition of an inertial frame given in Section 5. 

We shall obtain the solution in two stages: first we shall prove that the 
required transformation is linear and then we shall determine its ecoefticients. 

From the linearity of the transformation it follows that the motion of the 
new inertial frame relative to the old is uniform and rectilinear. This conclusion 
is essentially merely another formulation of the principle of relativity, according 
to which any frame of reference moving uniformly in a straight line with respect 
to some given inertial frame is itself inertial. Thus the principle of relativity 
is already contained in the premisses of the theorem to be proven. 


8. Proof of the Linearity of the Transformation Linking Two Inertial 
Frames 
It follows from the first condition characterizing inertial frames (the 
mechanical condition), that the property of a motion to be uniform and recti- 


linear must be preserved in going from one inertial frame to another. This 
means that the equations 


L= Lot et— lo); yyy trl —lo)s  z2=2%9 + v(t — ty) (8.01) 
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1ust lead to transformed equations 
a=a+v(t'—4); y=yit v(t — t); z= 2, + v(t’ — 4) 
: (8.02) 
‘he second condition for inertial frames demands that from the equation 


6) -[) Gy yl-2 oe 


in the unprimed frame there should follow, in the primed frame : 


als) - ==) +(s5) +(3) [=o ee 
or) lee) lag 


Now, we can wmite this second condition in a form analogous to the first ; 
consequences of (8.03) are the ray equations (4.08) which express the fact that 
light propagates in straight lines and with constant speed. This must be true 
in both the unprimed and the primed frames. Therefore, the point of inter- 
section of a light ray with the wave surface must satisfy equations 
& = Zq + vz(t — to); Y = Yo + vylt — to); Z = 2%, + Vet — to) 
(8.05) 


where ve vy +ve=c¢? 
in the unprimed frame and likewise 
e=atu('—) y=ytvl—h), ~=% + vit — bo) 
(8.06) 
where ve =e? or = cP 


in the primed one. Thus the second condition amounts to an additional require- 
ment that, if in the equations for uniform rectilinear motion v® = c’, then 
also v’? = c?. 

Our task amounts to finding the most general functions 


P= ft eet) 
y = fi(a, Y, 2, t) (8.07) 
2 Sf bys ee 0) 


t' = fi,(z, y, 2, O) 
such that the primed equations given above follow from the analogous 
unprimed ones.t 
Before tackling the problem we shall pass to a new, more symmetrical 
notation. We put 
rene Y = Le, 2 2 (8.08) 
and introduce instead of time a quantity proportional to it 
toad (8.09) 
which has the physical meaning of the distance that light would travel in the 
given time. (Note that in the new notation z» no longer denotes the initial 
value of a space coordinate.) We write the required transformation as 


x; = filo, Ty, %,03)  (¢ = 0,1, 2,3) (8.10) 


{ This problem has also been studied by Umov [4] and by Wey] [7]. 
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We shall denote the initial values of x», 2, 2, and 23 by Eo, Es, &, and Ea with 
a corresponding notation in the primed system. We introduce an arbitrary 
positive constant 7, and a parameter s given by s = (¢/Yo)(t — to) and put 


er Ae (eee) (8.11) 
Cc 


Then the equations of uniform straight line motion read as 
a=titys  (¢=0,1,2,3) (8.12) 


and our first condition requires that from (8.12) there should follow 
y= Ei tys' (¢ =0,1, 2, 3) (8.13) 
where the & and the ; are new constants and s’ a new parameter which can 
be expressed in terms of the corresponding unprimed quantities as soon as 
the form of the functions f; is known. 
The second condition amounts to the requirement that from 
B—Ot ++) = 0 a 
there should follow 
Yo" —(yi? + v2? + 3?) = 0 (8.15) 
The transformation equations (8.10) must obviously be soluble for the old 


variables 25, 2%, Z and xz, so that the Jacobian of the transformation must 
not vanish 


Det 2 #0 (8.16) 
OxE 


We now proceed to derive the equations for the functions f;. 
Considering the 2; as linear functions of the*parameter s given by (8.12) 
we have 


3 
Bi Ot (8.17) 
ds ork 
k=0 
The derivative 
sal (8.18) 
dxy Yo 
must be a constant by our conditions, so we have 
3 
Dye(Pfilerx) . 
ie = -! = const (8.19) 


D1l@folon) 


We equate to zero the logarithmic derivative of this expression with respect 
to s and obtain 


3 3 
> yEyue?fi/Oxx G21) > yeyi(O7fo/ xx 021) 
k.l=0 


pa , = > (8.20) 
>. ye fi /@xx) > YH Ofo/Orx) 
k=0 k=0 


In this equation the arguments on which the partial derivatives of f, and fi 
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depend are the quantities (8.12) which in turn depend on the y;. However, 
since the &; in (8.12) are arbitrary we can consider the yi and the a; to be inde- 
pendent variables. (We can reach the same conclusion by putting s = 0 in 
(8.20) and then writing a; instead of £;.) Therefore the equations (8.20) must be 
satisfied identically in the vi and the xj. 

As functions of the ; the expressions (8.20) are rational fractions, but it is 
impossible for all four denominators, i.e. the four expressions (8.17) for 7 = 0, 1, 
2, 3, to become zero simultaneously for any values of the +;, because the determi- 
nant (8.16) is always different from zero. Therefore each of the fractions always 
remains finite, even if its denominator becomes zero. This can only be so if the 
denominator is a factor of the numerator and, therefore, the expressions (8.20) 
are not, in fact, fractions but rational integral functions of the y; which we can 
write in the form 


3 
2 a (8.21) 
o°fi Cie 
27 8.22 
Thus we have > VEYA a, yn hoe ae, 2 yi (8.22) 
As this is an identity in the y:, we have 
2 
AIOE egy ce (8.23) 
OL, OXI "Om OLE 


The conclusion is the following : in order that one uniform straight lme motion 
should transform into another the transformation functions f; must necessarily 
satisfy the system of partial differential equations (8.23) where the qx are 
certain functions of 25, 2, Z, and 73. 

We now find the condition that must be satisfied by the functions f; in order 
that (8.15) should follow from (8.14). Owing to (8.19), equation (8.15) is 


equivalent to 
3 af 2 3 3 oft 2 
Oy 0 24 
(> nz) >, (> ng) ie) 


This must be a consequence of 
vo — (yi + v2 + 3) = 0 (8.14) 
As the left-hand sides of both (8.24) and (8.14) are quadratic functions of the 


yi they must be proportional to each other. To state the consequences of this 
conveniently we introduce the four quantities. 


Oya 13 eS tga Sl (8.25) 
and write (8.24) and (8.14) in the form 
3 Of afi 
pipgty te ote) 8.26 
CRY Ban (8.26) 
1k, T=0 
3 
>» & Ye = (8.27) 
k=0 


Taking the left-hand sides of (8.26) and (8.27) to be proportional we get 
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(8.28) 


tM 
g& 
| 
Is 
| 
~ 
AS 
=” 


where Ais some function of x», 21, 2, and z, and the symbol 54: has its familiar 
meaning 


8x1 = 0 for AAL; Sex = 1 (8.29) 
We differentiate (8.28) with respect to 2m, put 
Gs, (8.30) 
OL, 


and have then 


Of, Of af, Of 
Doge ee ae i #) = LhomenSut (8.31) 
Inserting herein the expressions (8.23) for the second derivatives and using 
(8.28), we obtain 
erbadim + exbidem + 24exbmder = 2enpmdet (8.32) 
This relation must hold for all values of k, 1, m. Putting k =1, m 41, this 
gives 
i= Om (8.33) 
Then, putting k = m, 1 = m, we get Ym — 0, therefore 
Vm = Om = 0. (8.34) 


The quantity A in (8.28) and (8.30) must then be a constant and the equations 
(8.23) for the f; become 


Of; 
wage 0 (8.35) 
The final result is that the combination of the two conditions characterizing 
inertial frames requires the transformation formulae between the coordinates 
in the two frames to be linear. 
The question as to how much follows as a consequence of each of the two 
conditions (i.e. of (8.23) and (8.28)), separately, is considered in Appendix A. 


9. Determination of the Coefficients of the Linear Transformations 
and of a Scale Factor 
Equations (8.28) for the function f; involve a constant multiplier A. In order 
to bring. out explicitly its influence on the transformation we shall write the 
linear functions f; in the form 


3 
a= fi= Vae TD extinr) (9.01) 
k=0 


Equations (8.28) will be satisfied identically in 4 if the coefficients ay, satisfy 
the relation 


3 
>, C1ekQir = €xDK1 (9.02) 


i=0 
from which it also follows that 
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3 
D, Cred, = exdx1 (9.03) 


1=0 


The equations (9.01) can be solved for the a4, leading to 


0 > evan( =H = a) (9.04) 


k=0 
We also have 


dary? — (day? + das? + dal?) = Ndag — (da? + dak + da2)] (9.05) 
and, for an arbitrary function w: 


()’ (33) (=): Oey \ 2 
ao} > Maal ayaa a=] 
OX, OX, OX (=) 
=5((S)' (se) =) Ow \? 

ia) ~ Uae) +(e) +(e) ]} 8 
The factor A, or rather 1/A, evidently characterizes the ratio of the scales of 
measurement in the primed and unprimed frames; in either frame the scale 
is a common one for space and time coordinates so that a change in it has no 
effect on the values of either angles in space or of velocities. We shall show that 
‘this factor should be equated to unity. 

Consider a point at rest in the primed frame. In the unprimed frame the 
velocity of this point is then the velocity of the primed frame itself. We denote 
the components of this velocity (in the unprimed system) by V,, V, and V3, 
so that 

day dx; , , , 
Ve= Eo for dz, = dx, = dx, = 0 (9.07) 
But, from (9.04) we have 
iL 


f * 1 td 
dx, = Th Ay ,AX,; dx, = Ta Ay AX, (9.08) 
and, therefore, Vi=c-% (i =1,2,3) (9.09) 
Boo 


It follows in the first place that the transformation (9.01) which gives the rela- 

_ tion between the four coordinates in the two inertial frames, corresponds to 
passing from a given frame to another moving uniformly and in a straight line 
relative to the first, as it should be. 

Further, it follows from (9.09) that the relative velocity of the motion is not 
at all connected with the scale factor A and, therefore, that this factor cannot 
depend on the relative velocity.t| Then one has only to assume that for 
vanishing relative velocity, length and time measurements in both systems are 
performed in the same way, in terms of the same units, and one finds 4 = 1. 
Subject to such very natural requirements A will be equal to 1 for any mertial 
frame, whatever its velocity. 

+ It is usually said, following Einstein, that the scale factor can “ evidently ” depend on 
nothing but the relative velocity, and it is subsequently proved that, in fact, it does not have 
any such dependence but is equal to l. 
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Putting 4 = 1 reduces our formulae (9.05) and (9.06) to the simpler form 
da? — (da? + da? + day) = dai — (dat + daz + dag) (9.10) 


Ow \ 2 Ow \ 2 Ow \ 2 ea ) i 
oad (5) ~ [le + Cae) + (Fe 
Ow \ 2 Ow \ 2 Ow \ 2 Ow \2 
Pep, apa = — 9.11 
& (=) or (=) zi (=) | ie 
while the transformation equations from either frame to the other become: 
3 


Li = 4a; + > Cy QipLy (9.12) 
k=0 
3 
and i > Cy Ont (x; = a) (9.13) 
k=0 


These relations carry the name of Lorentz transformations. They constitute 
the formal basis of the whole theory of relativity. 

One should note that the starting point of our discussion was the requirement 
that in every inertial frame the equation describing the propagation of an 
electromagnetic wave front should have the form (5.01). Thence it followed 
that the vanishing of the right-hand side of (9.11) led to the vanishing of the 
left-hand side. But by imposing an additional condition to the effect that a 
motion uniform and rectilinear in one frame should remain so in the other and 
that scales of measurement should not alter we found more: the left- and 
right-hand sides of (9.11) not only vanish together, they are identically equal. 


10. Lorentz Transformations 


A Lorentz transformation is a set of equations for transforming the space 
and time coordinates in one inertial frame into those of another that moves 
uniformly and in a straight line relative to the first. The transformation can 
be characterized by the fact that the quantity 


ds? = dx? — (dx? + dx? + dz?) (10.01) 
or ds? = c? dt? — (dx? + dy? + dz?) (10.02) 
remains invariant in the strict sense (not only the numerical value, but also 


the mathematical form of the expression remain unchanged.) The most general 
Lorentz transformation is of the form 
t= a; + > e042, (10.03) 
k=0 
where the coefficients ai, satisfy the relations 
3 


2 CADiKAiL = RSE] (10.04) 
=0 
3 
and >, Axiau = exdx1 (10.05) 
i=0 


Let us investigate the physical meaning of the constants entering these trans- 
formation formulae. 


First of all, the constant terms a, evidently represent a change of origin for 


space and time coordinates. If we take it that at t — 0 the spatial origins of 
both frames coincide and t’ = 0, we have 


“u=0 (i=0,1,2,3) (10.06) 
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In the following we shall make this assumption and shall write Lorentz trans- 


formations in the form 
3 


ay = > epinay (10.07) 
k=0 
We have already discussed the physical significance of the coefficient ratios 
1 
2b ay (10.08) 


They give the relative velocity between the two systems, divided by the speed 
of light. More precisely, the V; are the components in the unprimed frame of the 
vector giving the velocity of the primed frame relative to the unprimed one. 
Since the transformation inverse to (10.07) has the form 

3 


x, = > ed, ,2, (10.09) 
k=0 
the quantities V; given by 
; 1 
ey (10.10) 
Gig € 


are the components in the primed frame of the velocity vector of the unprimed 
frame relative to the primed one. 
If in (10.04) and (10.05) we put & = 1 = 0, we obtain 


Bio — (Gio + 39 + G30) = 1 (10.11) 

and aby — (a3, + a2 + as) = 1 (10.12) 
Whence, using (10.08) and (10.10), it follows that 

V2 + V3 + “V3 = Vi? + VA? + ‘V3? (10.13) 

and Vr)? (10.14) 


Here V2 is understood to be the square of the magnitude of the velocity. 
Therefore the relative speed measured in either frame is the same—a result that 
is not obvious although it corresponds entirely with intuitive ideas. 

It follows from (10.08) and (10.12) that 


1 


= SS ast 10.15 
oe = Vd = Ve) oon 
' The square root must be taken positive, for 
at’ 
Oe = = > 0 (10.16) 
A negative value of ao, would imply a change in the direction of time. 
From (10.08) and (10.15) we get 

Ay, = ome (v = 1, 2, 3) (10.17) 

8 ee — V9) 

and, similarly, from (10.10) and (10.15) 
i (i = 1, 2, 3) (10.18) 


ayy = V(e2 — V2) 
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Taking one suffix to be zero in the orthogonality relations (10.04) and (10.05) 
and using (10.17) and (10.18) one obtains 


Ay Vi = AV, + ayV, + a3V, (2 = 1, 2, 3) (10.19) 
and Oy V, = OV, + oyVi tay, (6 =1,2,3) (10.20) 
If both suffixes are different from zero the orthogonality relations give 
ViVi 
By yAyy, +H AypBo, TP Aggy = Oy, 1 eye (10.21) 


For 71, k = 1, 2, 3 we shall now introduce new quantities «, instead of the 
Qik by 


40% ox 
eee 10.22 
ik 4} area (10.22) 
Viv 
or ak = — dk + (49 — 1) = (10.23) 


It is easy to verify that as a result of (10.20) and (10.21) we have 

Oy Hae + Mion + Mgt, = Six (2, & = 1, 2, 3) (10.24) 
Thus the am are the coefficients of a three-dimensional orthogonal trans- 
formation and can be interpreted as direction cosines between the two sets of 


spatial axes with the first suffix referring to the new axes, the second to the old. 
Using (10.19) and (10.20) equations (10.22) give 


mV t+ aV_ + HV, = — V, (10.25) 
and ayVi + %,V, + a5,V, = — V, (10.26) 


These relations can be interpreted as saying that the vector V’ (the velocity 
of the unprimed frame relative to the primed one) and the vector V (the velocity 
of the primed frame relative to the unprimed) are equal in magnitude and 
opposite in direction. 

The equations obtained allow us to express all the coefficients of a Lorentz 
transformation in terms of the direction cosines ai (7, & = 1, 2, 3) and the 
three components V,, V, and V; of the relative velocity. As the nine a are 
subject to the six relations (10.24) and hence are expressible in terms of three 
independent quantities, the Lorentz transformation involves six parameters 
in all, not counting the constant displacement terms a; that were set equal 
to zero. 

We have already obtained the expressions (10.15) and (10.17) for the coeffi- 
cients @) and dy; For the aj we can take the expressions (10.18) in which 
the V; must be understood as expressed in the form (10.25), viz. : 


1 3 


a ae 
ip Fea vs 2 “iV. (i= 1,2,3) (10.27) 
For the azz, finally, we can put 
ViVe 
dik = — Wk + (oo — 1) “yz (10.28) 


or, more explicitly : 
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= 1 Vix 
Qik = — Ok -( ara aa 1 re > aVy (10.29) 
Let us insert the values of transformation coefficients Just found into (10.07), 
writing ct for x, to get back to more descriptive notation. We then get 


j 1 1 
= /(1 — Vc) ( oe (Via, + Vex, + Va)) (10.30) 


The expressions for the spatial coordinates are most conveniently written as 


b> ee (10.31) 
m 
where 


l Vouk 
2%, = Lm — Vat + (Gite _ 1) Tf 2 Vilax —_— V xt) (10.32) 
k=1 


We see that the Lorentz transformation can be performed in two stages. The 
first consists in going from variables (2x,, x», Zq, t) to new variables (the) 
where x¥, x¥ and x¥ are given by (10.32) and ¢* is equal to ¢’ and given by 


1 ‘ ] ‘ 
ie — V(1 = V2/c2) (: = ne (V2, ae Ver, + Vat) (10.33) 
The second stage consists in going from (x7, x3, x3, t*) to (a1, 24, 24, t’) where 
by (10.30) and (10.31) 


t= 6,0 + a,7* +a,0%; t =o (10.34) 
The second stage is evidently simply a rotation of the spatial reference frame, 
while the first is a transition to a frame moving with the velocity (V,, V., V3) 
without any rotation of spatial axes. 

Of course we could also have performed the rotation of axes first and then 
gone over to the moving frame. 

The reverse transformation can also be performed in two stages, firstly going 


from (x1, 23, 24, t’) to (x¥, x3, xf, (*) by the equations 

OF = a0, toy t, + yt, t= 0 (10.35) 
which represent simply a rotation of axes, and secondly passing from 
(ax, of, rx, (*) to (2, Le, 2, t) by means of 
1 


Vege So : 
Vl — V¥e%) ‘) ve Zz Vlog + Vit%) (10.36) 


Ly, =, Lin aP Vi a ( 


and 


1 1 
| en (5 = (Vat + Vast + Ved) 10.37) 
a Hs Teg smal 22 303 ) ( 

which are the inverse formulae to (10.32), (10.33). The transformation from 
(a*, xx, ax, t*) to (a), Ze, Lg, t) and its inverse go over into each other if the sign 

of the velocity V is reversed. . 
One should note that the Jacobian of these transformations is unity. As 
regards the substitution (10.34) and its inverse (10.35), the Jacobian will be 
+1 if they describe a rotation of axes in the proper sense, i.e. without an 
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accompanying passage from a right-handed frame to a left-handed one, or 
vice versa. Therefore, if the direction of time is preserved and also the right- 
handed (or left-handed) character of the spatial frame, the Jacobian of the 
Lorentz transformation will be equal to 1. Such Lorentz transformations are 
customarily called proper. In the following we shall only use proper Lorentz 
transformations. 

A rotation of axes is not properly a transition to a new inertial frame, so it 
is of no interest to us. The characteristic properties of a Lorentz transformation 
are contained in equations (10.32) and (10.33) and their inverses (10.36) and 
(10.37). These equations simplify if the directions of the axes are so chosen 
that one of them, e.g. the first, is in the direction of relative velocity. Putting 


V,=V; V,=V,=0 (10.38) 
we get from (10.32) and (10.33) 
x, — Vt \ 
ay = van V2/e2)’ Ly = Le; ae = Ly (10.39) 
2 Saale (10.40) 
~ Vl — VI : 


The inverse equations are obtained by replacing Y by —V. For greater 
descriptiveness we shall write x, y, 2 instead of z,, 2,, 23. Replacing the asterisk 
by a prime, we then get 
; e—Vt | : ; ’ 
y= Va — Pee y =Yy; z= 2 (10.41) 
t — Vac? 
V(1— V%c?) 
and, expressing z, t in terms of 2’, t’: 
w+ Vt : 5 
7 = TT Van y=y'; 2=2 (10.43) 
_ U4 V2'/c? 
~ a = VCH 
The transformation given by these formulae is of a special kind, but it contains 
all the characteristic features by which the new theory differs from the old. 
It is convenient to write the formulae for Lorentz transformations in vectorial 
form. Using familiar three-dimensional vector notation we get 


and i = (10.42) 


and t (10.44) 


fe al ly. n| (10.45) 
. JQ — V?/c?) ce 
1 Vv: 


11. Determination of Distances and Synchronization of Clocks within 
One Inertial Reference Frame 


Before going on to discuss consequences of the Lorentz transformation, let 
us return to the questions already raised in Sections 1 and 2 concerning the 
measurernent of distances and time differences in a single reference frame. 
We shall dwell a little more on the concept of simultaneity at different points in 
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space and on the problem of synchronizing clocks at a distance from each other. 

There are different methods of measuring distances between bodies at rest 
in an inertial frame: one can directly apply a graduated rule or “ measuring 
rod’, one can use triangulation, or else one can use radar. The first method 
relies only on the properties of rigid bodies, the second depends also on the 
rectilinear propagation of light, in the third the knowledge of the speed of light 
plays an essential part. It goes without saying that all methods assume the 
validity of Kuclidean geometry ; if it were not valid then, for instance, different 
triangulation procedures could give mutually contradictory results. We have 
stressed in Section 2 that the validity of the laws of Euclidean geometry should 
be taken as an experimental fact and not as an a priori assumption. 

All three methods of measurement are based on the properties of solid bodies 
and of light. Since light is the simpler phenomenon one should consider it as 
the primary entity and should, for instance, check the constancy of a standard 
of length by optical means—by finding the number of wavelengths that fit 
on to the standard. 

To measure the speed of light one must know how to measure distances and 
time intervals by methods independent of the knowledge of that speed. If, 
however, one assumes that the speed of light is the same in some direction and 
its reverse, it is enough to be able to measure time at one point. Any periodic 
process, in principle, gives one this ability, for instance, the vibration of an 
ammonia molecule or of a quartz crystal. We shall call a device that measures 
time a clock or chronometer, regardless whether it is in fact a mechanical 
contrivance, such as a common clock, or whether it works on some other principle. 

The measurement of the speed of light basically amounts to the determination 
of the interval +t during which light passes to and fro along a path of length 7, 
previously measured by triangulation or with a measuring rod ; the speed of 
light will be ¢ = 2r/t. Thus the problem amounts essentially to finding a 
conversion factor from a distance expressed in units of length and measured by 

‘triangulation or by direct application of a measuring rod, to a distance expressed 
in units of time and measured by radar methods. This conversion factor can 
be determined once and for all. 

We now pass to the question of comparing the readings of clocks at rest in 
the same inertial frame, but situated at a distance from each other. We assume 
that the clocks are of identical construction and run at the same rate, so that 

_ the question is simply that of synchronizing them. Let “clock A” be at the 
point A and “clock B” be at the point B, a given distance 7 from A. A light 
signal is emitted from A at time ¢, ; it is reflected by a mirror placed at B and 
returns to A at time t,. What time should be indicated by the properly syn- 
chronized clock at B at the moment the signal reaches that point ? We shall 
assume that clock B is synchronous with clock A if at the moment the signal 
arrives it shows the time ¢’ = (t; +¢,)/2. This is Hinstein’s definition of 
synchronization for two spatially separated clocks. It is a very natural defi- 
nition and is in accord with the basic postulate of the Theory of Relativity 
that the speed of light is constant. For, if the signal leaves A at time ¢, it must 


reach B at the time . 
t = ty + Fs 
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having traversed the distance r with speed c. On the other hand, as it returns 
to A at time t,, having previously traversed the distance 7 from B to A, it 
must have been reflected from B at the earlier time, 

t 


Lai,=2 
c 


It follows from these two equations that we must attribute to the time of 
reflection of the signal at B as measured by clock A the value ¢ = (t, + t,)/2, 
and if clock B is to be synchronous with clock A it should show the time 
t’ =t at the instant of reflection. 

From the fact that the speed of light is the same going to and fro it follows 
that the above defined property of being synchronous is a reciprocal one of the 
two clocks: if clock B is synchronous with clock A, then clock A is also 
synchronous with clock B. Further, since Euclidean geometry is valid and the 
speed of light is the same for any direction of travel (both these facts are implied 
in the equation for the propagation of the front of a light wave), it follows that, 
if clock A is synchronous with several clocks B, C, D--, all these clocks are 
mutually synchronous. 

This last fact makes it possible to envisage as a model of an inertial reference 
frame a rigid scaffolding having at each of its junctions a clock, with all clocks 
synchronous. Using such a model the motion of any body relative to the inertial 
frame can be described as follows: let the body move in turn past the clocks 
that have spatial coordinates (x1, y,, 21), (Va, Ya, 22), ete., with the clock at (a4, 
Yi, 24) Showing time ¢; at the instant of the body’s passage. Then the coordinates 
z(t), y(t) and z(t) of the body will be such functions of time as to have the values 
(xi, yi, 21) at the times fy. 

For all its unwieldiness such a model can be of use and is often discussed in 
treatments of the Theory of Relativity. However, we prefer the model of a 
radar station because it allows a continuous determination both of the position 
of a moving body and of the corresponding time, by the station clock—a deter- 
mination based on a single point, the position of the station. Because the speed 
of light is independent of the velocity of the source both models give the same 
values of space and time coordinates, the difference between them is merely 
that the radar model does not need any advance determination of distance, or 
synchronization of clocks, but instead, so to speak, performs these functions as 
the motion proceeds. 

The radar model is the more flexible one and retains its intuitive value even 
in cases when the rigid scaffolding is quite inappropriate, for instance, when 
discussing astronomical distances. In addition, the radar model is one that 
permits of generalization. 

The above exposition of Einstein’s method of synchronization by light signals, 
with due account of retardation, appears so natural that one might think at 
first sight that it contains nothing characteristic of the Theory of Relativity. 
However, this is not so. The method indicated contains a definition of simul- 
taneity at different points in space from the standpoint of a given inertial 
system. This definition rests on the laws of the Theory of Relativity and is 
not arbitrary, but, on the contrary, is the only rational one from the point of 
view of this theory. 
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In pre-relativistic physics it was assumed as self-evident that a ringle universal 
time exists and therefore it was tacitly supposed that there is no need to define 
simultaneity at different points in space. Consequently it was assumed that 
any method of synchronization, for instance by transport of clocks instead of 
by light signals, would give the same result. In fact this is not so. 

We shall show later that the’ Theory of Relativity predicts that if clock A 
is synchronized with clock B by light signals and a chronometer C is checked 
at A with the clock there and then transported to point B, its reading when 
at B will not agree with that of clock B, even assuming ideal working of the 
chronometer. The readings of C will depend on the speed of its transport and 
will coincide only in the limit that this speed is infinitely small. 

We shal] return to this question in the next section. Here we wished to stress 
that even such a simple physical concept as simultaneity in a single inertial 
frame requires precise definition, and that all methods employed to measure 
the corresponding physical quantity must be in accord with this definition. 


12. Time Sequence of Events in Different Reference Frames 

The Lorentz transformations contain the formal basis of the views on space 
and time that follow from the Theory of Relativity. 

Using these transformations we now consider the question of the time 
sequence of events in different reference frames. By ‘“‘ event”? we mean an 
instantaneous occurrence that can be characterized by a point in space and a 
corresponding moment of time. 

To have a concrete picture, we shall imagine that the “ events ” consist of 
the instantaneous flashing of light signals. Let the first flash happen at time 
t, at a point with coordinates 2, y, and z, and the second at time é, at the 
point (%, Y2, Z2). Introducing ordinary three-dimensional vector notation we 
can characterize the place and time of the first event by the symbol (r, 4) 
and thosc of the second by (Fg, é.). 

We first ask: which of the two flashes occurs first ? The answer will be 
unambiguous if the light from one of the flashes has time to arrive at the place 
of the other before the latter occurs. If 


1 
bo b> |r. --r, | (12.01) 


the-first flash is incontestably earlier than the second, and of course if 
l : 
lg ty) <- F ir, - £,| (12.02) 


the flash (r,, /,) certainly takes place later than (To, ¢,). 

We shall talk of pairs of events for which cither (12.01) or (12.02) is true as 
being in time sequence. In the case (12.01) we shall say that the second hap- 
pening is absolutely later than the first and that, if (12.02) holds, it is absolutely 
earlier. In both cases one has 

eX(t, — t,)? — (tr, —1,)? > 0 (12.03) 
The real positive quantity 
l : E 
T =~ V(o%X(ty — 4)? — 2 -- 41)" (12.04) 
c 
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is called the tome-like interval between the two events. The interval between 
two events in time sequence is independent of the reference frame because the 
expression under the root in (12.04) is invariant under Lorentz transformations. 

By the condition ét’/ét > 0, which expresses the conservation of the direction 
of time (see (10.16)), the relations (12.01) and (12.02) are also independent of 
the frame. This must be so because the question whether a signal has or has 
not arrived at the position of the other event is a physical one and cannot 
depend on any reference frame. Thus the notions “ absolutely earlier ” or 
“ absolutely later’, applied to events in time sequence, are invariant. 

Let us assume now that the light from one flash does not reach the place of 
the other flash before the latter occurs. 

Then inequalities opposite to (12.01) and (12.02) will hold : 


1 1 
~s[te—n[<a—-4<c|rm—r| (12.05) 


Pairs of events for which the inequality (12.05) is true will be called quasv- 
simultaneous. This name is justified by the fact that in this case the notions 
“earlier” and “later”? become relative ones: one may find t, —¢, >0 in 
one reference frame and ¢, — 4; <0 in another. The question as to which 
flash happened first has now no unique answer. 

Quasi-simultaneous events can be characterized by the invariant inequality 

C%(ly — ty)? — (rp —1,)? <0 (12.06) 
which follows from (12.05). The two relations, (12.05) and (12.06), are equi- 
valent and, therefore, (12.05) is also invariant. We shall call the real positive 
quantity 

R= V[(re — 41)? — e(t2 — &)*] (12.07) 
the space-like interval between two quasi-simultaneous events. 

We shall. now show that if two events are quasi-simultaneous it is always’ 
possible to choose a frame of reference such that the events are simultaneous 
in it, and that if two events are in time sequence a frame can be found in which 
they occur at the same point in space. 

We consider two quasi-simultaneous events with a given value for the time 
difference ¢, — ¢t,, which must, of course, satisfy (12.05). We introduce a new, 
primed, reference frame moving with speed V relative to the first frame. 
According to the transformation of time given by (10.30), the value of the 
difference t’ — t; in the new fraine will be 


1 - 1 
V(1 — V*/c*) ‘ 2k 
The relative velocity V can be so chosen as to make the two events simul- 
taneous in the new frame. To do this, we merely-have to put 


E—-h= 


(tr, —r,) | (12.08) 


V = (r,.—r,) re aa (12.09) 
By virtue of (12.06) we have 
c4(t, — ¢,)? 
Vve=—* 9 (12.10) 


| [fo P 


so that the required relative speed is less than the speed of light, as is required 
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for a Lorentz transformation. 

To calculate the difference Of the spatial coordinates of the two quasi-simul- 
taneous events in the system in which they are actually simultaneous we write 
the equation (10.32) for the Lorentz transformation in the vectorial form : 


1 V 
r =r V+ (ae - 1) Rewer — 19 (12.11) 


Inserting the difference t, — t, for ¢, the difference r, — r, for r and expression 
(12.09) for V we obtain for r’ = r, — rj the expression 


r, —T, = (rf. —1}) Jb — ea = (r, —r,) Jt — =) (12.12) 


In the primed system the spatial distance is thus seen to be 


ei (12.13) 
where FR is the space-like interval defined in (12.07). Equation (12.13) also 
follows directly from the invariance of the expression (12.07) for R and the 
condition ¢, — t, = 0. 

Thus the physical significance of the space-like interval between two quasi- 
simultaneous events is that it is their spatial distance in the frame in which 
they are simultaneous. 

Let us now consider two events in time sequence ; for these either (12.01) 
or (12.02) must hold. We shall show that here a new, primed, frame can be 
introduced in which the spatial coordinates of both events coincide. To see 
this it is sufficient to introduce the difference ¢, — ¢, instead of t, and r, — r; 
instead of r into (12.11) and to put 


, , 
|r Tr 


(12.14) 


Then the vector r’ = r, — r, vanishes, hence 

i= 7; (12.15) 
Because of the inequality (12.03) the magnitude of the velocity V is less than 
the speed of light 

V2<@ (12.16) 


as it should be. 
In the new frame the time difference between the two events is, by (12.08): 


g-a=e—a) [1 oet) = (t, — ty) J(i-4) (12.17) 


Hence, if t, — t, > 0, we get 
“%—t =T (12.18) 
where T is the time-like interval (12.04). aon, 
In this way the physical significance of the time-like interval is put in evidence. 
It is the time elapsed between two events in time sequence, measured in that 
frame in which both events occur at the same point in space. This reference 
frame can be very directly visualized. It consists of, say, a clock moving 
uniformly in a straight line from the place of the first event to that of the second 
with such a speed as to be exactly at the first point when the event there occurs 
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and likewise at the second point at the time of its event. 

One should note that the relation of being in time sequence is transitive : 
given that a second event occurs absolutely later than a first and a third abso- 
lutely later than the second, it follows that the third is absolutely later than 
the first. This property is physically obvious, but can also be proved as follows : 
The two inequalities 


1 1 
tp —t >= [heh ; ts —tp >= |ta—Ta| (12.19) 
lead by addition to the third, 
] 
t,; —t, > 7 rs —F1| (12.20) 


since the sum of two sides of a triangle is greater than the third side. On the 
other hand the relation of being quasi-simultaneous is not transitive. If two 
events are quasi-simultaneous and a third is quasi-simultaneous with one of 
them, it may also be quasi-simultaneous with the other, but it could also be in 
time sequence with it, i.e. happen absolutely earlier or absolutely later. 

Let us summarize. In the theory of relativity one divides events according 
to their instants of occurrence into quasi-simultaneous events and events in 
time sequence ; this distinction does not depend on any particular reference 
frame. Two events in time sequence possess an invariant time-like interval, 
equal to their time difference in a certain reference frame. Two quasi-simul- 
taneous events possess an invariant space-like interval equal to their spatial 
distance in a certain frame. 

This division in Relativity of pairs of events into two classes agrees with the 
notion of causality and makes it more precise. Given two quasi-simultaneous 
events neither can be the immediate cause or effect of the other. (Of course, 
they may both have as a common cause some third event that occurs absolutely 
earlier than either.) Only pairs of events in time sequence can be directly causally 
related, the one that occurs absolutely earlier may (but, of course, need not) 
be the cause of the other. 

The concepts we have introduced are natural generalizations of the notions 
of pre-relativistic physics. The older ideas were not in accord with the fact 
that a finite limit exists for the speed of propagation of all kinds of action, the 
finite speed of light. They were not defined precisely enough and therefore gave 
rise to paradoxes. The new scheme of ideas incorporates the finiteness of the 
speed of light and so removes the paradoxes, 

In older physics the idea of simultaneity was introduced a priori, without 
experimental justification. We shall consider one of the paradoxes to which 
the idea led. Assume we have two reference frames in uniform rectilinear 
relative motion and such that at a certain instant their origins coincide. Let a 
flash of light occur at that instant at the common origin. We consider the 
motion of the wave front from the point of view of the two frames. Seen from 
the first frame the wave front at any instant of time is a sphere centred at the 
origin of this frame ; but the same can be asserted with reference to the second 
frame—at any moment the wave front is a sphere centred at the other origin. 
The two origins only coincide at one instant, later they move apart, but a sphere 
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can only have one centre. This discussion uses, on the one hand, the Principle 
of Relativity, i.e. that in two frames in uniform rectilinear relative motion all 
events occur in the same manner, on the other the principle that the speed of 
light is independent of the motion of the source—a consequence of the idea of 
a field. Both these principles are quite incontrovertible. How then is the 
paradox resolved ? 

The resolution is to be found in the fact that the words “at any instant of 
time” have a different significance when applied to one or the other frame. 
The paradox arises only if the phrase is taken to mean “ at any instant of a 
unique absolute time, common to both frames”. The existence of such a 
unique absolute time was taken for granted in pre-relativistic physic ; it is 
denied in Relativity. We have learnt that in Relativity time in one frame is 
not the same as time in another. In each frame the wave front is defined as 
the locus of all points reached by the light disturbance simultaneously with 
respect to the given frame. Since events that are simultaneous in one frame are 
not so in any other, the wave front in the one frame will consist of points different 
from those of the front in the other frame. If we make the picture that the wave 
propagation takes place in some rarified medium like a gas the wave fronts in 
the two systems will cover different sets of particles. Since we have two 
different wave fronts, it is not surprising that they have different centres. 

This resolution of the paradox shows up vividly the logical necessity of 
renouncing the ideas of “ absolute time” and “ absolute simultaneity ”’. 

Half a century ago when the Theory of Relativity was just emerging, this 
renunciation appeared to many as wellnigh unacceptable and it required great 
scientific boldness on the part of Einstein, who created the theory, to convince 
himself of its necessity. Today we accept the renunciation much more readily. The 
everyday notion of simultaneity is covered by the concept of quasi-simultaneity 
while in scientific matters, where we deal with great distances or great speeds, the 
necessity for a more precise definition of time appears quite natural. 


13. Comparison of Time Differences in Moving Reference Frames. 
The Doppler Effect 

Let us assume that a certain frame of reference or base is given, which is 
taken to be at rest. We observe from this base the development of some process 
occurring in a moving system. As such a process we could take the uniform 
running of a clock attached to a moving frame of reference. To follow the 
behaviour of the moving clock one could arrange for it to emit a light signal 
every second, or one might be able simply to observe its dial from a distance. 
In addition the base needs to be equipped with some means, say radar, to 
record continually the distance of the clock from base. We can then observe 
with a clock at the base the times of arrival of the light flashes from the moving 
clock and correct each of these recorded times for the time of travel of the 
signal. In this way we can determine the times ¢, at which the supposedly 
second-by-second flashes were emitted. The correction mentioned is very easy 
to calculate. 

Let r(t) be the distance to the moving clock as measured from the base, 
expressed as a function of base time. Then ¢y is obtained from the directly 
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observed arrival time t* by the equation 


i 4 Me ts (13.01) 


c 


The question is now whether the times t, will be spaced at intervals of just one 
second. In other words, will the calculated times t, coincide with the seconds 
of a clock at the base identical with the moving clock under observation ? 

The equations of the Lorentz transformation answer this question. Assume 
that the clock is moving away from the base along a straight line passing 
through the origin of the base coordinates. (We take it that the signals are 
being received at the origin.) Taking this straight line as the z-axis and 
assuming that the moving clock was at the origin at t = 0, the position of the 
clock is given by the equations 


z= vu; y=0; z=0 (13.02) 

and z is just the distance of the clock from the base. Using equation (13.01) 
we see that the signal received at the base at time t was emitted at the instant 
t* 

ja 

1+ u/c 

reckoned by the base clock. Using the equations of the Lorentz transformation 

we now introduce a reference frame fixed to the moving clock : 


(13.03) 


x — vt t — vafc? 


"ya Se le 
As we know, the equations inverse to these are: 
— pe _ + v8'/e? 
ame ear ME eis 


In the primed system the position of the clock that emits the signals will be 
xv’ = 0, y’ = 0, z’ = 0 and the instants of signal emission are 


{=n (13.06) 


where the constant + depends only on the structure of the clock, not on its 
motion ; it equals one second in the example of the text. Therefore we have 
nT 


On the other hand an identical clock at the base would record the times nt. 

Therefore if the moving clock is observed from the base it would appear to be 

running slow compared to the base clock. The emission times that are observed at 

base will be not t (one second) apart, but less frequent ; the spacing being given by 
v 


At = aaa (13.08) 


One should remember that this expression applies after correcting for the finite 
speed of the signal with the aid of (13.01). The actual instants at which the 
signals are observed to arrive at the base are given by 


fe (1 ae “tn = ne,/ (4) (13.09) 
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so that their difference is A =o yi ( (13.10) 


To make the-discussion simple we have dealt with the case of the clock moving 
directly away from the observer (i.e. from the device registering the light 
signals). We now consider the more general case that the trajectory of the 
clock does not pass through the base. Let the receiving device still be situated 
at the origin, but let the moving clock coordinates now be given by the equations 

x= vt; Y= Yo; Z=Zq (13.11) 
in the base frame. We may still put 

we =0; y=0; 2=0 (13.12) 
in the moving frame. 

As before one passes from one frame to the other by means of the Lorentz 
transformation (13.04) or (13.05), the only difference being a shift y — y’ = yg, 
z— z' = z,along the axes of y and z. Therefore, the connection between t’ and t 
and thus also (13.08), giving the rate of the moving clock, remain unchanged. A 
change appears only in the connection between ¢* and ¢, i.e. between the instant 
of emission and the instant of reception. We shall now have instead of (13.09) 


t+ ~y(o%t + y2 + 22) (13.13) 
If we take the time difference t to be small we can put 
Aes ape (1 2 ae (13.14) 
dt C 
where Vr = ge = Ses a (13.15) 
dt v/(vt? + ¥8 + 2) 


is the radial component of the clock velocity at the instant the signal is emitted. 
From (13.08) and (13.14) we get 
ee C+ v; ; iy nt 
Ae a (13.16) 
This replaces (13.10) in the general case. 

The fact that At* and t+ are different is an expression of the well-known 
phenomenon of the Doppler effect, which is the following : if a periodic process 
of period t+ takes place in a moving frame then the period At* registered at some 
point in the rest frame proves to be greater than t, if the motion of the system 
is away from the point and less if the motion is towards it. Equation (13.16) 
gives the relativistic expression for the Doppler effect in the case of light 
propagating in vacuo. The classical Doppler formula is obtained if no dis- 
tinction is made between ¢’ and ¢ and At is replaced by 7 in (13.14). This 
introduces a relative error of order_v?/c?. 

One might remark that historically the first determination of the speed of 
light in empty space was performed in 1675 by Olaf Roemer using a method 
essentially based on the Doppler effect. The system of Jupiter’s satellites 
served as the moving clock and the periodic process observed consisted of their 
eclipses. During roughly half of its orbital motion the Earth approaches 
Jupiter, so that v, < 0, and for the other half of the time it moves away and 
Vr > 0. Using the simplified expression (13.14) instead of (13.16) one can take 
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for At the annua] mean of the observed time between eclipses. On that part of 
the Earth’s orbit on which v, < 0, the time A¢* will be less than At and the 
eclipses will be observed ahead of the expected time, while it will be greater 
on that part where v; > 0 and the eclipses will then come late. Thus the relation 
(13.14) allows a calculation of the speed of light c. In this way Roemer found a 
value of 3-1 x 10!° cm sec-!, very close to the now accepted value of 3-0 x 10?° 
em sec~}, which is obtained by much more accurate methods. 

Let us stress once again that in the theory of the Doppler effect two con- 
siderations are taken into account: firstly, the connection between the instant 
t* when the light signal arrives, and the instant ¢ of its emission, and, secondly, 
the relation between the time ¢ in the frame in which the signal is registered 
and the time ¢’ in the frame moving with the emitting body. The first of these 
considerations, passing from ¢* to ¢, introduces a factor depending on the radial 
component of the velocity ; it was already known in pre-relativistic theory. 
The second consideration, transforming from ¢ to ¢’, introduces a factor depending 
on the magnitude of the velocity ; it arises from the Lorentz transformation 
and is specific to the Theory of Relativity. 


14. Comparison of Clock Readings in Moving Reference Frames 
In the preceding section we examined a method of comparing time differences 
in different reference frames which relied on the use of light signals and took- 
due account of their time of travel. However, it is also possible, in principle, to 
use another method in which one only compares the readings of clocks when, 
in the course of their motion, they are in immediate proximity of each other. 
We imagine a row of clocks spaced out along a straight line and all belonging 
to the same frame. They are to be at rest in this frame and are supposed 
synchronized from the beginning. Suppose a clock A moves past the row of 
clocks. To obtain the rate of clock A it is sufficient to compare at any instant 
the reading of A with the reading of that base clock which A happens to be 
passing at the moment in question. Clearly it will be enough to consider two 
clocks in the base, so that we can think of the latter as formed of two clocks 
joined by a rigid bar. 
Referred to the base, the coordinates of the two clocks at rest and of the 
moving clock A will be 
x=a (first base clock) 
z= (second base clock) (14.01) 
x= vt (clock A) 
Using the Lorentz transformation formulae (13.04) and (13.05) we can introduce 
a reference frame connected with clock A. In it the coordinates of the same 
three clocks will be 


x’ = — vt’ 4-a’' (first base clock) 
az’ == —vt' +b’ (second base clock) (14.02) 
x= 0 (clock A) 


with the constants a’ and b' related to the a and b of (14.01) by 


a’ =a,/(1 -"), vy =0,/(1 -3) (14.03) 
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Without making any choice between the two reference frames we can register 
the following objective facts: first, the readings of clock A and of the first 
base clock at the instant when they are coincident and, second, the readings of A 
and the second base clock when in turn they coincide. We denote the two 
described readings of A by t and ¢, and by t, and ¢, the corresponding readings 


of the first and second base clocks respectively. Then we get, using (14.01) 
and (14.02) 


a , a 
Vv Vv 

, og (14.04) 
ly = - 3 by = 

UV UV 


f . Ff 2 
gma /(i—§); g=4/(1-9) (14.05) 


These results enable us to check, on the one hand, the rate of clock A observed 
from the base, and on the other hand, the rate of the base clocks observed from 
a frame fixed to A. We mean by the rate of a periodic process as observed from 
a particular frame, the rate expressed in terms of that time variable which is 
defined by the synchronization appropriate to the frame. 

The quantities ¢, and t, are readings of different base clocks, but as the two 
clocks are synchronized in the base frame, we can maintain that when the 
second clock read ¢t,, so did the first, using the word ‘“‘ when”? in the sense of 
that particular synchronization, Therefore the difference t, — ¢, is just the time 
elapsed in the base frame reckoning during which the reading of clock A 
advanced by ¢, —¢,. Thus the rate of clock A observed from the base is 
determined by the equation 


t, —t = (t, — adel (i — *) (14.06) 


Since A ai til<|t—4t |, clock A will be slow to an observer at the base. 
Putting ¢, — t; = +, and t, — t, = At, our equation coincides with (13.08) 


We now consider the rate of a base clock observed from a frame attached at A. 
To check this rate it is now essential to observe the readings of one and the same 
clock at base, either the first or the second. For definiteness let us consider the 
second. We have only one dtrect reading for it, namely the one for the instant 
when it was coincident with clock A, and then it read t,, while clock A read ¢,. 
The other reading of the second clock must be calculated from the available data. 
We therefore ask : where was the second clock and what was its reading when 
the first clock was coincident with clock A? It is essential to remember that 
the words ‘‘ where ” and ‘‘ when ”’ in the above must now be used in the sense 
appropriate to the frame of A, the primed frame. The question is readily 
answered. When the first clock and A coincided, the latter indicated the time 
t, =a’Jv, and then, according to (14.02) the second clock was at the point 
z’ = b’ —a’. The reading of the second clock is then obtained by inserting 
t’ =t, and «’ == b’ — a’ into the Lorentz transformation (13.05). 
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We get _ KEM —a)_ oy 
t= (1 — vc) => ty + oa? a) (14.07) 

2 
or eee ae ee Is (14.08) 


(The reading of the second clock now does not agree with the reading of the 
first, which is ¢,, because now simultaneity is understood in the frame of A 
and not of the base.) From the reading of the second clock, as given by (14.08), 
together with the directly observed reading ¢,, one finds the rate of the second 
clock in the frame of A. We have 


tp —t=(t.— Ay(1 ao “) (14.09) 


2 
whence, finally, t, —¢ = (4, —&) Ra. (14.10 
y 


Thus in the frame of A the clocks at base are again slow; we have complete 
reciprocity between the two frames. 

It is sometimes said that time passes more slowly in the moving frame than 
in the frame at rest. As the Principle of Relativity allows one to interchange 
the roles of the two frames, this formulation is contradictory. 

The nature of the kind of misunderstanding here involved can best be clarified 
by means of a mathematical illustration which has a direct bearing on our 
question. We saw in Section 10 that in a Lorentz transformation 

ay 1 
a 8 (1 — 0/2) 
but that for the reverse transformation also 
at 1 
— =—d@ So 
ot’ 0 A/(1 — 02/02) 


If one forgets that in (14.11) the differentiation takes place at constant 2, y, z 
and in (14.12) at constant 2’, y’ and z’, it seems strange that 2@t/@t’ is not the 
reciprocal of a€’/a but equal to it. It is clear, however, that there is in fact 
no paradox. 

Returning to the physical side of the matter one can say that the problem 
in question is not concerned with “ passage of time” in different frames but 
with the description of some localized process within different frames. Let the 
process be localized at a point at rest in the unprimed system so that 2, y and z 
are constant. Then we conclude from @’/dt> 1 that the duration, or period 
of the process in ‘‘its own’, unprimed frame will be less than in any other 
(primed) frame that is in motion relative to the former. If, however, the process 
is localized at a point with constant coordinates x’, y’, z’ then its “own”? or 
“proper ” system will be the primed one and we have é/dt’> 1, but the 
conclusion is the same. 

If the duration of the process in its “ proper” system was drt, then it will be 
dt > dt in another system moving with speed V relative to the first where 


dt = Jt = ‘) dt (14.13) 


>1 (14.11) 


>1 (14.12) 
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Here V is the velocity that enters the Lorentz transformation which relates the 
two frames. The magnitude of V is equal to that of v the velocity of motion of 
the point at which the process is localized. The components of v are 


d. a d. 
= Fa (14.14) 


Ve = 


We can therefore write, instead of (14.13) 


dt = Jt = ) dt (14.15) 


It is easy to see that this expression is invariant under Lorentz transformations. 
The quantity dt can be thought of as the differential of a “ proper time” + 


given by 
yt 
= yl — ") dt (14.16) 
0 


where we have assumed that t=0 at t =0. If the velocity v is constant 
then + is the duration as measured in its “ proper ” frame of the process associ- 
ated with the moving point-hence the name “ proper time”. If, however, v 
is a variable velocity the physical meaning of + is not obvious. Even so it can 
be looked upon as an auxiliary mathematical quantity which it is convenient 
to use because of its invariance under Lorentz transformations. One still retains 
the name “ proper time ”’ for it in the case of variable velocity v. 
_ If a moving body, a clock, is in a gravitational field which is the cause of its 
acceleration, one can give a different expression for the time shown by this clock, 
an expression which in addition to the velocity v, also involves the gravitational 
potential (see (62.03) ). The expression (14.16) will not be correct in this case. 
‘The correct expression may be justified in Einstein’s theory of gravitation, 
although it is not a direct consequence of that theory (see Section 62). As for 
the ordinary theory of relativity, it allows one to draw conclusions in a general 
form (i.e. without going deeply into the nature of the processes occurring) only 
in relation to unaccelerated motion. In general, however, no theory is capable 
of predicting how a clock will behave when subjected to impacts or arbitrary 
_ accelerations without entering into the details of the clock’s construction. This 
cannot be achieved by the theory of gravitation either; the expression given in 
Section 62 for the time shown by a clock in accelerated motion relates to the 
case when the acceleration is caused by the gravitational field. 

One could also introduce the notion of clocks nearly or, in the limit, entirely 
insensitive to impacts and accelerations (for instance atomic systems with large 
internal frequencies). One could then propose to measure proper time t by the 
readings of such an ideally insensitive clock, this being just the physical meaning 
of proper time. But one should note that such a proposal, although not in 
contradiction with the theory of relativity, does not follow from it and represents 
a special hypothesis. We shall return to this question in Sections 61 and 62. 
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15. Comparison of Distances and Lengths in Moving Reference Frames 

If an object is at rest in a given frame of reference the determination of its 
geometrical shape or dimensions need not be performed instantaneously. 
An object at rest may be measured gradually, by noting the positions of various 
points one by one. On the other hand, if one has to measure the dimensions 
and shape of a moving object it is absolutely necessary that the observations of 
all its points should relate to the same instant, otherwise one obtains a distorted 
picture. 

It is thus clear that the concepts of dimensions and shape of a moving object 
are closely related to the concept of simultaneity. We know already that the 
notion of simultaneity is not something absolute but depends on the reference 
frame. Therefore we must expect that the shape and size of an object is also 
not absolute, but must be stated in relation to a definite frame. 

As the simplest example, let us consider the length of a rod measured in 
different frames of reference. 

Let two frames be in relative motion in the direction of their common z-axis. 
Their space and time coordinates are then connected by a Lorentz transforma- 
tion which we restate : 

, Z—vU | F t — vale? 
% =A oye) 5/0 vie (15.01) 
y=y Zz) = 2 
Let the direction of the rod be the same as the direction of the relative velocity 
of the two frames—..e. as the x-axis—and let the rod be at rest in the unprimed 
system. At all times the coordinates of the ends of the rod in that system will be 
r= a, y=0, z2=0 (first end of rod) 
a y = 0, z=0 (second end of rod) 
and if b> a the length of the rod is 
l=b—a (15.03) 
In the primed frame, which moves relative to the rod, the coordinates of its 
ends will be 
gv = —vt' +a’, y’ = 0, 2’ =0 (first end) 
y =0, 2’=0 (second end) 


: v2 , 2 
where a’ =a (1 — “3 b’ = »/(1 -*) (15.05) 


The length of the rod is the distance between simultaneous positions of its ends. 
In the primed system simultaneity means equal values of ¢’ and distance is 
expressed in the usual way as a difference of primed coordinates. Therefore 
the length of the rod in the primed frame will be’ 


LO a! (15.06) 


, v 
Hence b= in/(1 ~5) (15.07) 


Thus, in the frame in which the rod has a velocity v in the direction of its length 


it appears shortened ; its length I’ is less than the length J obtained for the 
rod at rest. 


(15.02) 


(15.04) 
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If we were to take into consideration the transverse dimensions of the rod 
(those in the directions of the y and z-axes and perpendicular to the velocity), 
we could verify that they do not change. Consequently the volume of the rod 
decreases proportionally with its longitudinal dimension. The same result is 
true fora body of any shape. If V is the volume of a body in the system in which 
it is at rest, its volume V’ in a frame relative to which it moves with speed v 


will be 
j vy? 
v= vf -*) (15.08) 


Assume we have two identical rods parallel to each other with a relative velocity 
directed along their axes. Then, in the frame connected with the first rod the 
second will appear shortened, and vice versa. This state of affairs is the same 
as in the clock example and, as before, there is no paradox. The difference in 
the measured lengths comes from the different definitions of simultaneity. 
The positions of the ends of the rod that were simultaneous in one frame are 
not simultaneous, but only quasi-simultaneous in the other. Let x be the 
position of the end A at the instant tg and 2», the position of the end B at ty. 
The corresponding quantities in the other frame will be denoted by the cor- 
responding primed symbols. By the invariance of the space-like interval we have 


(xz, — ,)? — P(t, —t,)? = (a, — a,)? — c%(t, — 4)? (15.09) 
If we here put tg = tp we shall have ¢, ~ #, and hence 
7; =%|=|2, -4 (15.10) 


On the left we have the length of the rod in the unprimed system ; the quantity 
on the right will be the length of the rod in the primed frame provided it is at 
rest in it, for then it is immaterial whether 2/(¢’) and 2,(t’) refer to the same 
time ¢’ or not and we can take 2, = 2/,(t,) and x, = 2,(t,) with f, + &,. 

If we put ¢, =¢, we have ta # t& and hence 

|e, — x|>|2, —45| (15.11) 
but now the left-hand side is the length of the rod in the frame in which it rests 
whereas the right-hand side is the length in an arbitrary primed frame. 

In these considerations the symmetry of the equations with respect to 
primed and unprimed frames is evident from the beginning and, thercfore, the 
considerations themselves may appear trivial. We have given them in order 
to emphasize once again the closc relation between the concepts of simultaneity 


and of length. 


16. Relative Velocity 

In pre-relativistic mechanics the relative velocity of two bodies was defined 
as the difference of their velocities. Let the velocities of two bodies, both 
measured in the same frame of reference, be u and v respectively. Then the 
velocity of the second body relative to the first used to be defined as w = v — u. 
This definition is invariant with respect to Galileo transforinations but not 
Lorentz transformations. Thcrefore it is not suitable in the Theory of Rela- 
tivity and must be replaced by another. The fact that w = v — u has no 
physical meaning becomes evident by examining the following example. Let 
the velocitics u and v have opposite directions and have magnitudes near to 
the specd of light or equal to it. Then the “ velocity ” w will have a magnitude 
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near or equal to twice the speed of light, which is evidently absurd. 

We shall, therefore, give a new definition of relative velocity which is in accord 
with the requirements of Relativity and has a direct physical meaning. Let the 
velocities of two bodies in some frame of reference be u and v as before. We 
can introduce a primed frame of reference in which the velocity of one, say the 
first, body vanishes. Then we can interpret the velocity v’ of the second body 
in this frame as the relative velocity of the two bodies. We shall see that the 
magnitude of v’ will depend symmetrically on u and v, so that the so defined 
relative velocity of two bodies does not depend upon which body was chosen 
to be at rest in the new frame. 

To illustrate the physical significance of our definition we consider an example. 
Imagine we are observing two aeroplanes from the ground and let their velocities 
be u and v respectively. Assume that the first plane has radar equipment 
permitting a measurement of the speed of the other plane relative to itself. 
The velocity so measured will be the relative velocity of our definition. 

We must express this relative velocity in terms of the components of the 
velocities u and v of the two planes, as observed from the ground. For this 
purpose we write down the general formulae for a Lorentz transformation 
deduced in Section 10. These are 


r’ =r — Vt + (ao yes — V%), 


V2 
. a (16.01) 
(= Zoo t— ar 
and their inverse is 
, ‘ Vv , 
r=r’ + Vt’ + (@, — 1) V3 (V-r’ + V%’), 
ey (16.02) 
t => Zoo l + 2 
c 
where we have used the notation of (10.15), putting 
‘ 
(16.03) 


P00 Wl — Vier 


To have the first plane (the one of velocity u, relative to the ground) at rest in 
the primed system we must choose 


Vz = Uz; Vy = Uy; Vz = Uz (16.04) 


The velocity of the second plane measured from the ground is 


dr 
vas (16.05) 
while its velocity measured from the other plane is 
, ar 
vez (16.06) 


The relation between these two quantities is found by differentiating equation 
(16.01) and (16.02). We have 
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ye Vt oo — 1)(u/u2){(a-v) — u%} 


Pa EATS (16.07) 
and 
_V LU + go — 1)lu/u2}{(u-v') + 03} 
= => ayy (16.08) 
where, by (16.03) and (16.04) 
1 
oo = Va — wjo%) (16.09) 


Equation (16.08) is the solution of (16.07) for v. As the Lorentz transformation ‘ 
is linear with respect to space and time coordinates the primed components 
v,, v, and v, are linear fractional functions of the unprimed component vz, 
vy and zz. Let us calculate the square of the vector v’, i.e. the square of the 
relative velocity of the two planes. We have 

via (u — v)? — (1/e*)[u x vj]? (16.10) 

(1 — u-v/c?)? 
As we have already noted, this expression is symmetrical with respect to u 
and v. We shall now verify that the inequalities u? < c? and v? < c? imply 
v2 < ce? whatever the direction of the velocity vectors u and v. Indeed we have. 
v2 (1 — w{e2)(1 — ve?) 

a ae (16.11) 
Because u* < c? and v? <c? the right-hand side is always positive, so the 
left-hand side must also be positive and hence v’? < c?. In the limit when one 
or other of the velocities u and v is equal to light velocity the relative velocity 
v’ will also be equal to it. This agrees with our basic assumption on which the 
whole of Relativity Theory is based. 

It should be noted that the square of the relative velocity is invariant under 
Lorentz transformations. This means that if in (16.10) we replace the quantities 
u and v by the velocities u” and v” of the two aeroplanes relative to some other 
frame, say a third plane, the value (16.10) will not be affected by the change. 
It is evident that this must be so by the meaning of v’? as the square of a relative 
velocity. 

Equation (16.07) simplifies if in the given frame the two velocities u and v 
are either parallel or perpendicular. In the first case we get 


v—u 
~ Ll —u-y/e? 


f 


Vv 


{for [u x v] = 0} (16.12) 


and in the second 


v=v Jt — “| —u_ {for (u-v) = 0} (16.13) 


Equation (16.12) with the opposite sign before u is usually called Einstein’s 
Addition Theorem for Velocities. 

In the general case one can assert the following: If “velocity space” is 
considered as a Lobachevsky space the relativistic addition theorem for 
velocities coincides with the vector addition theorem in Lobachevsky geometry. 
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This will be proved in the following section. 


17. The Lobachevsky-Einstein Velocity Space 


Let us consider the relative velocity of two bodies moving with infinitesimally 
differing velocities v and v + dv. Let ds be the magnitude of this relative velocity, 
divided by c. Inserting u = v + dv into (16.10) and dividing by c? we get 


c%(dv)? — [v x dv]? 


(c? = vy? 


ds? == (17.01) 


or 
(c? — v?)(dv)? + (v-dv)? 


(c? = we)e 


ds? = (17.02) 


By the physical mcaning of (17.01) as being proportional to the square of an 
infinitesimal relative velocity it must be invariant under Lorentz transforma- 
tions. This can be directly verified, by expressing vz, vy and vz by (16.08) as 
fractional linear functions of v}, v, and v, This makes ds* the same function 
of (v,, yp v,) and (dv,, dv,, dv,) as it was of (vz, Vy, Vz) and (dvz, dvy, dvz). 

We shall consider (17.01) and (17.02) as the squared element of length in a 
certain velocity space. This is the space in which the velocity hodograph of 
ordinary mechanics is constructed. This space possesses all the properties of 
Lobachevsky space, and the velocity components vz, vy, vz divided by c¢ are 
the so-called Beltrami coordinates in it (see V. F. Kagan [9], eq. CVIII p. 453). 
The properties of Lobachevsky space can be derived by examining the 
expression (17.01) 

A curve in Lobachevsky space can be given by stating vz, vy and vz as 
functions of some parameter p. If the values p, and p, of p correspond to the 
end points of the curve, the arc length along the curve is given by the formula 


Ps 
s= | v(2F) dp (17.03) 
where a 
1 oy 1 (vv)? 
F=- aL Z 
jem age a (17.04) 


differentiation with respect to p being denoted by a dot. We determine the 
equation of a Lobachevsky straight linc, -i.e. of the shortest curve joining the 
points p, and p,. This is done by equating to zero the variation of the integral 
(17.03), or, in other words, by writing down Lagrange’s equations for the 
Lagrangian 


L=4+/(2F) (17.05) 

These equations are 
BN a de Loe ae 17.06 
ie ib, ty = 0, ete. (17.06) 


or 
d ( 1 =| 1 ck 
V (28) bz) (LF) ere 
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Up to now thie parameter p was arbitrary. We now choose it so that 


see F = const (17.08) 
ap : . 
With this choice equations (17.07) are equivalent to 
d OF oF 
i Bn De = 0, etc. (17.09) 
Since F does not contain p explicitly, we have 
oF | OF | OF , 
be a + ag oe ee (17.10) 


i.e. the quantity (17.10) 1s an integral of the Lagrange equations. But F is a 
honiogeneous quadratic function of vz, vy and %z, so that (17.10) is equal to F 
and thus condition (17.08) is a consequence of the Lagrange equations (17.09). 

We now proceed to write equations (17.09) in more detail and to find their 
integrals. We have 


oF tr (v-v) (17.11) 
— 0 i 
Cte ce = ye - (c? a v?)? 
oF v? 2(v-v)? viv 
ras > on a eS + Uz ae ; (17.12) 
Ovz (c2 — v7)? (ec? — 7*)8 (c? — v?)? 
We introduce a vector w with components 
We = id = wy = ra ee v2 = - (17.13) 
ce? 2a pz . Cc? — vv? ce? -- ey 
Then equations (17.11) and (17.12) become 
OF v-w) 
—— = tz -} 2 17.14 
Ovz ee ve ( ) 
oF v-w)? 
eat ase Pal + 2 va 2) “+ W2(V- Ww) (17.15) 
Oz ce —v 


Differentiating (17.14) with respect to the parameter p and expressing V in 
terms of w we get 
d OF veiw v-w)? 7 
—— = tz +e ( ) -+ va( w! 4.2 & = ") + wz(V-w) (17.16) 


dp dbz cF — v 


Inserting (17.15) and (17.16) into Lagrange’s equations (17.09) we see that 
they take on the form 


deg as vee (17.17) 


c? — vy? 


It is easily seen that as an algebraic consequence of these equations (17.17) 
we have (v + w) = 0 whence it follows that (17.17) 1s equivalent to 
fine Gop 0,. aS (17.18) 


or, vectorially, 
w- 0 (17.19) 
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Thus Lagrange’s equations amount to the requirement that the vector w 
(defined by (17.13)) should be constant. But w is proportional to V; if it is 
constant, so is the vector product 

[w x v] = const. (17.20) 


This gives two further linearly independent integrals of Lagrange’s equations. 

From the integrals found we see that the equation of a Lobachevsky straight 
line in velocity space is a set of linear relations between the velocity com- 
ponents vz, vy and vz. We know that Lorentz transformations lead to a frac- 
tional linear substitution for the velocity components. It is, therefore, evident 
that after a transformation the linear relations remain linear. 

We now determine the length along a Lobachevsky straight line between 
the points v = v, and v = vz, and also establish a connection between this 
length and the relative velocity of two bodies of velocities v, and v, respectively. 
As the coordinates of points on the straight line are given by linear relations 
they can be represented parametrically in the form 


Vv =v, + uve — Vi) (0<p< 1) (17.21) 
Inserting this expression for v into (17.04) we get 
c(Ve — Vi)? — [Vy X Ve]? “9 


2h = 
(c2 — v2)? 


(17.22) 


Putting for brevity 
a = v/{c*(v, — Vi)? — [v, X V2]?} (17.23) 


we obtain from (17.03) an expression for the length along the line from v, 
to Vv, 


ad 
5c jot = (17.24) 


This integral is evaluated most simply by means of the substitution 
(co? ~ vE 


L= = eee eer (17.25) 

es ab dE 1 b+a 
wes — | eee 585 ree 
where b=c? —v,'v, (17.27) 


The coefficients of the substitution were so chosen as to have 0 and 1 for the 
limits of — and so that the denominator in (17.26) does not involve the first 
power of &. Hence we obtain 


= tanh s (17.28) 


~~! 8 


and, after squaring and inultiplying by c?, this can be written as 
(v2 — v,)? — (1/e?\v, X v2}? 

[1 — (v,-v,/e*)]? 
Comparing this expression with (16.10) we see that the left-hand side is the 


= c* tanh?s (17.29) 
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square of the relative velocity. Thus the relative velocity is related to the 
length of the corresponding segment of the Lobachevsky straight line by 


|v’'|=ctanhs (17.30) 


Let us assume that the velocities v, and v, have the same direction. Expressing 
their magnitudes in terms of segments of a Lobachevskv straight line we have 


v, =ctanhs,; v, =ctanhs, (17.31) 


The length of Lobachevsky line corresponding to the relative velocity will be 
equal to the difference of the lengths s, and s,. Therefore, if s, > s,, the relative 
velocity will be 


tanh s, — tanh s, 


v = c tanh(s,— s,) =e (17.32) 


1 — tanh s, tanh s, 
Ve — Vy 
1 (v20,/c?) 
This is just the Einstein addition, or rather in our case subtraction, formula. 
Let us now consider the angle between the relative velocities of two bodies. 
If the velocities are taken relative to a point assumed at rest the cosine of the 
angle is given by the usual formula 


f 


or v (17.33) 


Vi" Ve 
[>] || 
but if the velocities are given relative to a point that is itself in motion with a 
velocity u, the angle between the relative velocities is given by a more compli- 
cated formula, which, however, can easily be obtained as follows. We make a 
Lorentz transformation to bring to rest the point that was moving with velocity 
u. Then we can apply the usual formula (17.37), getting 


Cos (V,, Vo) = (17.34) 


Vi'Ve 
fo] || 
with v, and v, denoting the velocities of the two bodies after the Lorentz 


transformation. These velocities are obtained from (16.07), replacing v by 
v, and by vz. Expressing v, and v, by v, and v., we obtain after some 


manipulation 


(17.35) 


COS & = COS (Vj, V3) 


(v; —¥) (v2 — 4) — (U/e*)Iv, x uliv, Xu) 
V{(v, —u)? — (1/e*)[v, x uf} V{(v2 — a)? — (1/c?)[v, x u}*} 
(17.36) 
This, however, is just the expression for the cosine of tle angle of a triangle in 
Lobachevsky space. (It is the angle at u in a triangle whose vertices are at the 


points u, v, and v,.) 
Indeed our initial expression (17.01) for the square of the element of length 


in Lobachevsky space has the form 

, 0%(dv)? —[v x dv]? 
SS a 

(ep 
This expression corresponds to a “ displacement ” dv from the “ point” v. 
For the displacement 5v from the same point we have 
c?(Sv)? — [v x dv]? 
(c? — v2)? 


cos a = 


(17.37) 


$32 = (17.38) 
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The cosine of the angle between the displacements can be stated in an invariant 
manncr by means of the equation 
c2dv 8v —[v « dv][v x dv] 
(c? = vy)? 

(This formula can be considered as the definition of an angle in Lobachevsky 
geometry.) If now in (17.36) we write v instead of u and if we insert into 
(17.39) the displacements 

dv = e(v,; — V); dv = 7(V. — V) (17.40) 
along two sides of a triangle starting from the vertex v, the expression for cos « 
obtained from (17.39) is the same as in (17.36). 

Thus the angle between relative velocities can be thought of as an angle 
in Lobachevsky geometry. Given three bodies moving with velocities v,, V, 
and v, the corresponding triangle has its vertices at the points v,, V, and Vy ; 
the relative velocities will correspond to the sides of the triangle and the angles 
between the velocities to the angles of the triangle. A similar construction is 
possible in non-relativistic kinematics, but there the geometry of velocity space 
is Kuclidean whereas in the Theory of Relativity it is Lobachevskian. 

The validity of Lobachevsky geometry in velocity space has its experimental 
verification. We refer to Fizeau’s experiment which measures the speed of 
light in a moving medium and to the phenomenon of astronomical aberration 
discovered by Bradley. 

Fizeau’s experiment aims at comparing the velocity of light propagation in a 
moving and in a stationary medium. The propagation of light in a medium is 
a rather complicated process involving the charges within the medium; a 
velocity c/n can be attributed to the propagation where n is the index of re- 
fraction of the medium. If the medium itself moves in the dircction cf the 
propagation with a spced v then w = e/n will be the speed of light propagation 
relative to the medium. The spced relative to a frame at rest can then be 
obtained by Hinstcin’s equation 

P w+ 
wy = —_____ 
1 -++ (wee?) 


Inserting here w = c/n and retaining terms of first and zero order in ¢ we get 


se ao 1 1 1 
uw = hn + 7] eas nm = + o(1 -<] (17.42) 


The coefficient of v bears the name Fresnel’s convection coefficient. 

The fact that this coefficient is not unity shows that the addition of velocities 
must be performed according to Hinstein’s addition formula, corresponding 
to Lobachevsky geometry, and not according to the pre-rclativistic rule based 
on Euclidean geometry. 

The phenomenon of astronomical aberration consists basically in the fact 
that in two frames in relative motion the directions to one and the same star 
do not coincide but difler by the aberration. To find this quantity one should 
construct a Lobachevsky triangle with vertices at the points v,, v, and v, = ac 
where v, and v, are the velocities of bodies fixed in the two frames and a is a 
unit vector in the direction of the light wave emitted by the star. In the triangle 
Vi, Vp 5Vs the angle at the vertex v, will be zero (see equation (17.44) below) 


(17.39) 


ds 88 cos w = 


(17.41) 


2 
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and the sum of the two angles at v, and v, will be less than two right angles by 
the magnitude of the aberration. (If the geometry were Euclidean the sum of 
these angles would be equal to two right angles.) 

The required trigonometrical calculation can easily be performed using the 
formula 

(V2 —Vi)(Vs — Vy) — (1/c®)[v, x vi][v3 x vi] 
V{(¥2 — Vi)? — (1/e*)[Ve x Vi ]} -4/{(vs — vy)? — (1/c*)[v x v,]?} 
(17.43) 

and two other formulae obtained from (17.43) by cyclic permutation of the 


suffixes 1, 2 and 3. 
First of all, since v2 = c?, we have 


cos oy = 


cos a, = 1; a, = 0 (17.44) 

We then choose a frame such that 
Veodvese0 (17.45) 
and put \v,| = |v. =v; av, = —a-v, = 0088 (17.46) 


The relative speed of the two systems will be 
2u 


P12 = 1 + vet (17.47) 
From (17.43) we get 
_— 2 __ 4y2 4 
cos a, = eee sin a, = {v(c? — v*)} sin 8 (17.48) 
e —vcos8 c —vcos 8 
and, by analogy, 
2 _ 42 7 
i, Beg «a 
e+ vcosB e+ vcosB 
Denoting the magnitude of aberration by 8 we can put 
8 = T — 4 — hy — Og = KF — a — hy (17.50) 
The previous equations give 
: 2v? sin? B 
2 sin? x) =1 + cos (a, +2) = c? — v® cos? B (17.51) 
v sin B 
whence tan a) = Vet — v8) (17.52) 


In astronomical observations one compares the apparent position of a star 
at times of different direction of the Earth’s orbital motion (annual aberration). 
Since only the relative velocities of the bodies that receive the starlight enter 
our considerations, the overall motion of the Solar System relative to the stars 
considered will evidently have no effect as long as its velocity is constant during 
the lengths of time involved. It is, therefore, not possible to determine the 
velocity of a star relative to the Earth or the Sun by observing its aberration. 


CHAPTER II 


THE THEORY OF RELATIVITY IN TENSOR FORM 


18. Some Remarks on the Covariance of Equations 

In Section 6, when we discussed the basic postulates of the Theory of 
Relativity, we established certain general requirements to which any equations 
must conform, which determine the course of physical processes and do not 
involve initial conditions. These requirements fix the rules according to which 
such equations must transform on passing from one inertial frame to another. 

According to the Principle of Relativity the rules for transforming the 
independent variables and the unknown functions entering the equations 
must be such that the equations stated in one inertial frame are equivalent 
to equations of the same form in any other inertial frame. 

We have already used this requirement in discussing the Lorentz trans- 
formation. Indeed, we obtained this transformation from the condition that 
the equation for wave front propagation 


1 [éw\? dw \ ? dw \ ? Aw \ ? 
tee) ~ (Ce) + (5) +0) |=8 ore 
should preserve its form and also that uniform rectilinear motion should 
remain uniform and rectilinear in all inertial frames. Thus the rules for trans- 
forming the independent variables, i.e. the space and time coordinates, have 
already been fixed, by the above requirements, but we have yet to consider 
the transformation rules for the unknown functions occurring in the equations. 
The simplest transformation rule is simple invariance. For instance, in the 


case of equation (18.01) w is the unknown function. It is determined by the 
wave front equation 


w(x, y, 2, t) = 0 (18.02) 

The transformed function w’, giving the wave front equation in the new 
variables 

w'(x’, y’, 2’, ) =0 (18.03) 

is simply equal to the old function; its form is determined by the condition 

w’(2’, y’, 2’, V’) = w(2, y, 2, t) (18.04) 


However, in the majority of problems the form of an equation is only con- 
served if the Lorentz transformation on the independent variables is accom- 
panied by certain transformations of the unknown functions. If a trans- 
formation exists such that the new functions, expressed in the terms of the 
new variables, satisfy equations of the same form as do the old functions in 
the old variables, the equations are termed covariant. 
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The requirement of covariance of equations under Lorentz transformations 
is a necessary consequence of the Principle of Relativity. On the other hand, 
it is clear that not all equations are covariant. We shall have to examine the 
equations used in the physical description of various processes, e.g. the equations 
of electrodynamics and of mechanics, in order to decide whether they are 
covariant or not. If some of them are not, we shall have to alter them to a 
covariant form. 

A preliminary to investigating covanance and to formulating covariant 
equations is the study of the quantities with the simplest transformation 
laws. Only lnear laws need be considered. The complete classification of 
quantities according to their transformation laws is a question for Group 
Theory. We cannot here deal with it in its full generality and shall confine 
ourselves to what is essential for the formulation of the basic equations of 
mechanics and electrodynamics. 


19. Definition of a Tensor in Three Dimensions and some Remarks on 
Covariant Quantities 


The equations describing a rotation of the axes of a spatial coordinate 
frame are 


k=l Gi, k = 1, 2,3) (19.01) 


where the a, are the direction cosines relating the old and the new axes. 
Formally we can define a three-dimensional vector as a set of quantities (A,, 
A,, As), called components, which transform according to 


3 
Ai = > oA (19.02) 
k= 


1 
when the axes are rotated. A special case of a vector is the coordinate vector 
for which A, —2,, Ap = 2, and A, = 75. 
Let us assume that two vectors are given and that their components 4; 
and B, are connected by the linear relation 


3 
Bye= > TuAr (19.03) 


After a rotation cf the axes the new vector components will be connected 
by the analogous relation 


3 
B= >, Ye (19.04) 
k=1 
with 
3 ~_ 
oe = > aygenl jr (19.05) 
j,l=1 


For the old and the new components of the vector By; must be connected by 
the same transformation as those of Ay. If first we express the B; in terms of 
the B;; and then in (19.03) express the 4; in terms of the Aj, we obtain (19.04) 


and (19.05). 
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Thus the set of quantities 7’, transforms according to (19.05) when the 
axes are rotated. Such a set is called a tensor, or more precisely a tensor of 
the second rank. (In general the rank of a tensor is equal to the number of 
suffixes attached to it.) 

Examples of tensors occur even in non-relativistic mechanics. Thus the 
components of the angular momentum of a rigid body (B,) are connected 
with the components of its angular velocity (A,) by relations of the form 
(19.03) where 7, is the inertia tensor. Another example occurs in the theory 
of elasticity, where similar relations give the connection between the com- 
ponents (dFz, dFy, dF ,) of the force on some surface element and the pro- 
jections (dSz, dSy, dSz) of the surface element on to the coordinate planes. 
In this case the set of coefficients is the stress tensor. In both these examples 
the tensor Ty, is symmetric in its suffixes, but tensors also exist which do not 
have this property. 

Equation (19.05) shows that the components of a tensor transform like the 
products 25, of the coordinates (x,, x2, 3) and (€,, &,, £3) of two points, which 
may be coincident (a; = &). 

Let us consider an antisymmetric tensor, i.e. one whose components satisfy 
the relation 

Tae +Te = 0 (19.06) 
An example of such a tensor is the set of antisymmetric combinations formed 
from the products of the coordinates of two points 


Pix = vibe — Ever (19.07) 
As the suffixes we are here concerned with can only take on the three values 
(1, 2, 3), we can replace any pair of suffixes by the single missing one and put 
P.3 = —Ty,=T, 
T3,= —T,,=T, (19.08) 
Py.= — Ty, =T; 
We introduce the antisymmetric system of quantities defined as 
ext = 4-1, if (2, 4, l) is an even permutation of (1, 2, 3); 
eikt = —l, if (2, k, l) is an odd permutation of (1, 2, 3); 
eixt = 0 if any of the suffixes are equal. | 
Then (19.08) may be written as 


1 
T, => 3 > capil ix (19.09) 
“k=1 
The properties of the <q: ensure that in fact the sum in ( 19.09) has only two terms 
and if (7, #, /) is an even permutation of (1, 2. 3) the previous equation may be 
written as : 


T1=T (19.09%) 
Using the antisymmetry condition we can replace (19.05) by 
jo eek 
Te =5 2s (xipttng — cugetep)T pg (19.10) 


In this sum three of the nine terms vanish, namely those with p=g. The 
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other SIX are equal in pairs, so that only three essentially different terms occur. 
Let (2, k, 1) and (p, g, r) be even permutations of (1, 2, 3). Using the notation 
(19.09) and putting 
- ipl — Kiqkkep = Bir (19.11) 
we can rewrite equation (19.10) as 


3 
T = >6,,T, (19.12) 
Let us now consider the determinant , 
X11 Aye Ag 
A =| oo; ep tes (19.13) 


X31 Xg2 Aggy 


By the orthogonality of the transformation A? = 1, and, in particular, if the 
transformation is purely a rotation of the axes A = +1, while if the rotation 
is accompanied by a reflexion, i.e. a reversal of one or of three axes, A = —1. 
In both cases 

Bip = A> a (19.14) 


Thus for a simple rotation of the axes we have 
3 
T, = da, (19.15) 
r=1 


and for an “improper orthogonal transformation”, i.e. a rotation with 
reflexion, 
3 
T= ee > oy, , (19.16) 
r=1 

. Equations (19.15) and (19.16) show that in a pure rotation the set of quantities 
(19.08) transforms as a vector and in a rotation with reflexion their transfor- 
mation is the same as for a vector except for a change of sign of all components. 
A set of quantities having this transformation law is conventionally called an 
axial vector as distinct from a polar vector which is one that obeys (19.02) in 
all transformations. The vector product of two polar vectors is easily seen to 
be an axial vector. To give physical examples (in terms of three dimensional 
vector algebra), the vector of electric field intensity is polar and the vector of 
magnetic field intensity is axial. Equations (19.08) show that an axial vector 
is in essence a tensor of the second rank, therefore if the terms “ vector’ and 
“tensor ” are used in the sense of definitions (19.02) and (19.05) one can do 
without the term “ axial vector ”. 

Tensors of higher rank in three dimensional Euclidean space can be defined 
similarly. For instance, a tensor of the third rank is a set of quantities 7;,,, 
which transform on rotation according to the law 


3 

= 2% jm%nl mn (19.17) 

l,m,n=1 
Further, a quantity that does not change when the axes are rotated can also 
be considered to be a tensor, namely a tensor of zero rank. Such a quantity 
is called a scalar or an invariant. 

Given a tensor of the second rank one can find a linear combination of its 
components which does not change when the axes are rotated, and thus is a 


ajk 
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scalar. The linear combination having this property is the sum of the diagonal 

elements of the tensor, i.e. the quantity , 

Tas Ts (19.18) 
i 


Using the property > oye = Sy (19.19) 
t 


of the coefficients of an orthogonal transformation one verifies easily that 
(19.05) leads to 

Tl =T (19.20) 
so that T is a scalar. Similarly, from the components of a third rank tensor, 
one can obtain three vectors, namely 


A, a 2 Pimm } B, = 2, Pinim > C; = 2 Pinmt (19.21) 
™ ™m ™m 

One can ask whether it is possible to replace tensors by other quantities of 

various ranks in such a way that it is no longer possible to form from compon- 

ents of one rank any linear combinations that transform like quantities of 

lower rank. 

This can indeed be done and the resulting quantities have the transformation 
properties of solid harmonics. We recall that a solid harmonic Pim(z, y, 2) 
is a harmonic polynomial which, expressed in terms of spherical polar co- 
ordinates 


r=rsn dcosg; y=rsindsin g; = cos > 
becomes a product of 7’ with a surface harmonic 


rt Yims ) = Prin, Ys z) 

The order J is equal to the rank of the corresponding tensor, but the number 
of linearly independent ‘‘ components ”’ will be less than for a tensor. If the 
coordinate frame is rotated the new polynomial Pim(zx’, y’, 2’) can be expressed 
as a linear combination of the old polynomials Pim(z, y, 2), all with the same 
l, and with m taking on the values m = —l, —I+1,...,1. The coefficients 
of this lmear transformation determine the desired transformation law for 
quantities of rank 1. 

One final remark: We have assumed in our discussions that the tensors 
or related quantities are, by their physical meaning, completely determined, 
including their signs. Correspondingly, the coefficients in our transformation 
formulae were single-valued functions of the cosines a. However, one can also 
imagine quantities that are only defined except for a sign, so that only quadratic 
expressions in them are fully determined. In this case the coefficients of their 
transformations need not be uniquely fixed by the a but may instead be chosen 
to be determined only apart from their sign (the same in all coefficients). This 
leads to a new type of physical quantity, the so-called spinor, and to a cor- 
responding generalization of spherical harmonics and their transformation 
law. Such quantities are used in quantum mechanics, 

Tensors are thus not the only geometrical quantities with a definite transfor- 
mation law, but in ordinary, non-quantal applications of the Theory of 
Relativity the consideration of tensors is sufficient. 
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In the preceding section we summarized the definition of tensors in Euclidean 
space. We must now generalize this definition to apply to the four-dimensional 
space-time manifold. The part played before by rotations of the axes is now 
taken over by Lorentz transformations. An essentially new feature is the 
difference of sign between the space terms and the time term in the expression 


ds® = cdt? — dx? — dy? — dz? (20.01) 


which gives the square of an infinitesimal four-dimensional interval. We have 
seen that the invariance of (20.01) characterizes Lorentz transformations in 
the same way as the invariance of 


dl? = dx? + dy? + dz? (20.02) 
characterizes rotations of spatial axes. 

The difference in sign is extremely important for the whole theory because 
it reflects the existence of a deep distinction between space and time. 

It would be possible to impose the same sign on all terms of ds? by introducing 
imaginary quantities—imaginary space coordinates or imaginary time. This 
course was adopted by Minkowski and by Umov ; but we believe that it is not 
appropriate to introduce a symmetry between space and time into our equations 
in this manner because it obscures the actually existing distinction between 
them and does not have any mathematical advantages. Therefore we shall 
continue to use real space variables and a real time. 

If we put 

b= F755 L= 2; Y = 2%; z= Zy (20.03) 
the expression for ds? becomes 


3 
dst = day — det — dag — deg = 2 e, de (20.04) 


where the quantities ¢, are equal to +1 (see (8.25)). The direct and inverse 
Lorentz transformations can be written as 


3 
r= 2 C04 Ly, (20.05) 
=0 


3 
and Dy = > Cy Oy hy (20.06) 


Therefore the differentials of old and new coordinates will be connected by 
3 
dx, = > €,0,,dx, (20.07) 
k=0 


and the partial derivatives of any function ¢ with respect to the old and the 
new coordinates by 
Ot > aa (20.08) 
an, ~ 2, Ok By 
If the quadratic form (20.04) were definite, all the numbers ex, would be equal 
and so would the coefficients in (20.07) and (20.08). In fact, we have 
Pe i 6S 5 65 — 1 (20.09) 


and some of the corresponding coefficients in the two equations differ in sign. 
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In contrast to the case of purely spatial rotations, the transformation laws are 
different for coordinate differentials and for partial derivatives with respect 
to the coordinates. This must be remembered when defining a vector. We 
must distinguish between vectors whose components transform like the partial 
derivatives of the coordinates and those for which they transform like co- 
ordinate differentials. The former will be called covariant, the latter contra- 
variant. We shall write the components of a covariant vector with suffzes, 
or lower indices, and the components of a contravariant one with superfixes, or 
upper indices. Thus the transformation law for a covariant vector is written as 


3 
eee 2 
A,= >, €:444,4, (20.10) 
k=0 
and for a contravariant one as 
A‘ = * epaipaA * (20.11) 
k==0 


These relations can also be expressed as 
3 


on, 
= a: 20.12 
A= > aA (20.12) 
k=0 
3 ax’ 
and yee >: — Ak (20.13) 
= OL, 


One should note that the coordinates and their differentials are exceptional 
in violating the rule of index positioning. The notation dro, dx, dr,, dx, 18 
customary although by the general rule one should write (dx)°, (dz)', (dx)? 
and (dz). 

If a covariant vector A; is given we can always introduce a corresponding 
contravariant vector by putting 


At= eA, (20.14) 
The two vectors are not essentially different and we shall speak not of two 
vectors but of the covariant and contravariant components of the same vector 
and denote them by the same letter. 
The scalar product of two vectors A; and B; is defined as the sum 


3 3 
> eA By = > e,AtB!t (20.15) 
t= t= 
It can also be written as : : 
3 3 
S AiBt= > AB, (20.16) 
1=0 1=0 


This quantity is invariant under Lorentz transformations. 
The scalar product of a vector with itself 
3 


2 AtAt = (Ag)? — (Ay)? — (A)? — (A)? (20.17) 


1=0 
can be positive or negative. A vector for which (20.17) is positive is called 
“ time-like” and a vector for which this expression is negative is called “ space- 
like ”’. 

As an example of a time-like vector we can mention the four-dimensional 
velocity, whose components are defined by the equations 
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iO te eS 

Va = ve?’ Vl — vFe2) 

The fact that this set of quantities is a vector can be seen as follows : We know 
that the quantity 


(¢ = 1,2,3) (20.18) 


edt = ds = {y/(c? — v®)} dt (20.19) 
is invariant ; on the other hand the quantities (20.18) can be written in the 
form 

yu® = ot = dx, : u= dary 
dt dr’ 
Hence it is clear that u®, u}, uw? and u® transform like the differentials of the 
coordinates 2», 21, 2, 23. They are thus components of a contravariant vector. 
That the velocity vector is time-like follows from the identity 


(i = 1,2, 3) (20.20) 


(u°)? — (w1)? — (w?)? — (v3)? = c? (20.21) 

The four-dimensional vector of acceleration whose components are defined by 
0 dut 

ff ee Poe a Oe (20.22) 
dt dt 


is an example of a space-like vector. In more detail, we can write 


Pe 5 sam 
VW — v®/e%) di \a/(1 — v2/c2) ies 
ye 1 a) (i = 1, 2,3) 
~ a/(1 — vc?) dt\s/(1 — v2/e2) de as 


The space-like nature of this vector is proved by differentiating the identity 
(20.21) with respect to time, or to t, which gives 
uw? — uw) — u?w? — udw? = 0 (20.24) 
or cw? — vw! — vw? — v,w® = 0 (20.25) 
Hence we get the inequality 
2 


(0)? <2 {(w)? + (w®)* + (w9)2} < (wh)? + (w2)® + (w9)® (20.26) 
c 
which proves our statement that w is space-like. 


21. Four-Dimensional Tensors 
One can define tensors of the second and higher ranks in the four-dimensional 
space time manifold just as in three-dimensional space. 
A set of quantities transforming under Lorentz transformations according 
to the law 


2. ax, Oa 
— et ee 21.01 
lin = 2D, Gut aT oo 
j,t=0 
3 
or i CLA T 4) (21.02) 
j,1=0 


s called a covariant tensor of the second rank. Similarly a set of quantities 


62 The Theory of Space Time and Gravitation 


transforming according to 


2. ax, Oz; 
uk — YS —! "TI 21.03 
L py Ox, Ox ( ) 
jt=o J 1 
3 
or i > ejerayaxi lt (21.04) 
j,t=0 


is called a contravariant tensor. Lastly, one can also define a mixed tensor of 
the second rank as one that is covariant in one index and contravariant in the 
other. Its components transform according to 


3 ‘ 
many i 7 (21.05) 
z ax, Ox, 
t=0 
3 
or T= 5 C4 f2y 11} (21.06) 
j,t=0 


If all four numbers e; were equal, the coefficients in (21.02), (21.04) and 
(21.06) would be the same, in fact there are sign differences between corres- 
ponding coefficients. 

The transformation formulae (21.02), (21.04) and (21.06) are not essentially 
different, and one can easily pass from quantities transforming according to one 
of them to others transforming according to either of the other two. To do 
this one simply puts 


Te — cep Ty (21.07) 

while 7¢ may be defined either as 
y ieee ae (21.08) 
or as T' =e,T,, (21.09) 


If the tensor is symmetric the quantities (21.08) and (21.09) are the same. 
Just as for a vector, we shall not speak of different tensors but of covariant, 
contravariant and mixed components of one and the same tensor. 

Similarly, tensors of the third and higher ranks may be introduced. For 
example a tensor of third rank with three covariant indices transforms accord- 
ing to the law 


3 
Ox, Ox, Ox 
Tem = >» i aie eae (21.10) 
tk , 7 7 l 
" 1 ag OM he OL ry itt 
3 
or Tikm = >, Cee mAyjAKOmnal jin (21.11) 
j,l,n=0 


As in the three-dimensional case, one can form a scalar from components of a 
second rank tensor, namely 


3 
T = > alu ‘ (21.12) 
1=0 


and three vectors can be constructed from the components of a tensor of the 
third rank : 


A= decTur; Bi= Deeb; Ce= DerTent (21.13) 
k k k ; 


We shall not dwell on the question touched on at the end of Section 19 con- 
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cerning quantities other than tensors whose transformation laws are the simplest 
in a certain sense. This is because in all non-quantal applications of the Theory 
of Relativity the main role is played by tensors. 

Given two tensors other tensors can be constructed of a rank equal to the 
sum, or to the difference, of the ranks of the given tensors and also tensors 
of ranks intermediate between these, but of the same parity as the sum and 
difference. We shall show this by examples. In Section 20 we saw that two 
vectors A; and B;, can form the scalar 


C= 5 eAiBy (21.14) 


1=0 

Vectors are to be thought of as tensors of the first rank and scalars as tensors 
of zero rank. We can also use the same vectors to construct two tensors of 
second rank, namely 


Cin = ArBy and Cu = BiAr (21.15) 
Given a tensor of second rank Ty, and a vector Aj, the quantity 
B= > ex ArT (21.16) 
k 
will be a vector and the quantity 
Cin = Ail en (21.17) 


a third rank tensor; so will the quantities obtained by interchanging suffixes 
on the right of (21.17). 
As a last example we take second rank tensors Ty, and Uy, and form the scalar 


3 

C = > eer xl ix (21.18) 
the second rank tensor 1,k=0 

and the fourth rank tensor Cu = 2 emPimUs sid (21.19) 
In this case, evidently Corm = TueUim or) 
Cx => > énCimntn (21.21) 

and ™m 
C= > eCu (21.22) 

i 


The tensor character of all these quantities is readily verified from the properties 
of the coefficients of a Lorentz transformation as given by (10.04) and (10.05). 

It should be observed that in all sums containing products of covariant 
tensor components these products are multiplied by the sign factors, 1, ek, - - - 
where 1, k, etc., are suffixes over which a summation is performed. Instead one 
can write these sums so as to have each index occurring once as the covariant 
suffix of one tensor, and the other time as the contravariant superfix of a 
second tensor. In-this form the sign factors do not appear. Thus, for example, 
the scalar product (21.14) can be written in the form (20.16) and the sum 
(21.18), using (21.07), as 


3 
C= > THRU tr (21.23) 
1,k=0 _ 
Given a tensor, other tensors can be obtained not only by multiplication by 
a second tensor but also by differentiation. The operation of differentiating 


with respect to a coordinate a then appears in place of multiplication by a 


\ 


64 The Theory of Space Time and Gravitation 


covariant component of some vector. For instance, the quantity 


8. ad, At 
C= >, a5, = >. a (21.24) 
is a scalar and the set of quantities : 
B, = 2, ans (21.25) 
is a vector (the divergence of the tensor 7%). If in particular 
a (21.26) 
0x4 


2 2 2 2 " 92 
way ee ee (21.27) 
4 ox} ox? ox? ex? ox? 
1=0 0 1 2 3 


must also be a scalar. The symbol C is the d’Alembert operator. 


22. Pseudo-Tensors 


In addition to tensors it is convenient to consider other quantities whose 

transformation law depends on the sign of the determinant 
\ 
Dis oe) hee leas) (22.01) 
D(x, Ty; Ve, 3) 

of the transformation. The properties of the coefficients in a Lorentz trans- 
formation ensure that the square of D is always unity. D itself will be +1 
for proper Lorentz transformations, i.e. those which preserve the sense of time 
and also the right- or left-handedness of the spatial axes, but —1 for improper 
transformations. Although we agreed to consider proper transformations 
only, it is useful for the classification of geometric quantities to know also their 
behaviour for improper transformations. 

We consider a set of quantities egim, antisymmetric with respect to all 
suffixes and with € 55 = 1. It follows that if two or more suffixes are equal 
€tktm Maust vanish, that exim = +1 if (i 1m) is an even permutation of (0 1 2 3) 
and that egim = —1 if it is an odd permutation. 

The identity 


pars aa aT a ar =D etklm (22.02) 
pars=0 
is readily verified, for the left-hand side is the expansion of the determinant 
OXy OLy OLy OLg 
Gat Be, Be Bi 
0X3 OL, OL, OX, 
Ox, Ox, Ox, Ox}, 
which is obtained from D by permuting columns, provided all four suffixes 
(1 kl m) are different, and which vanishes otherwise. 
Equation (22.02) shows that if D = 1 the ¢ transform like components of a 
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tensor, but that if D = —1 their transformation law differs in sign from the 
tensor rule. A set of quantities having such transformation laws is called a 
pseudo-tensor. 

We can consider the ex1m to be covariant components of an antisymmetric 


pseudo-tensor of fourth rank. Its contravariant components are, by the 
general rule, 


etklm — CiC KCC mEtk In (22.03) 
but since the ¢ are only different from zero if all suffixes are different, in which 
CASE CslKE1em = Co€\€2€3 = —1, we have more simply 

etklm — —eypign (22.04) 
If 9 is a scalar the set of quantities 

Ptkim = PEtkim (22.05) 


is an antisymmetrical pseudo-tensor of fourth rank. Such a pseudo-tensor 
has only one component, just like a scalar. Hence it is customarily known as a 
pseudo-scalar. 


To every antisymmetric tensor Ay of the second rank one can adjoin an 


* 
antisymmetric pseudo-tensor A** of the same rank by the equation 
* iL 
Atk = » etklm Arm (22.06) 
l, m=0 
It is called the dual of the given tensor. Only two terms in the sum (22.06) 
differ from zero and these terms are equal, therefore we have 


* * * 
AMX = A,;; A®= A); A*°= Aj, (22.07) 


* * * 
A® = Aig; A®! = Ag; A® = As 


Similarly there corresponds to any antisymmetric tensor of the third rank a 
pseudo-vector—i.e. a pseudo-tensor of the first rank, given by the equation 
* Tee 
At= etl A 1m (22.08) 
6. i,m=0 
There are six non-vanishing terms in the sum, but all six are equal. Therefore 
we obtain the explicit form 
* * * * 
A® = —Ajas; At = Ags; A? = Asi; A = Ajo9 (22.09) 

for the components of the pseudo-vector. 

We shall refer for short to the “ product of a tensor with a tensor ”, meaning 
a tensor obtained by multiplying the components of two given tensors accord- 
ing to the rules of Section 21. Then we can say that the product of two pseudo- 
tensors, like the product of two tensors, is a tensor, but the product of a 
tensor with a pseudo-tensor is a pseudo-tensor. We have, in fact, already used 


* * 
this rule in forming the pseudo-tensor A‘* and the pseudo-vector At. 


23. Infinitesimal Lorentz Transformations 
The general Lorentz transformation, including a shift of origin, has the form 
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3 
;, = a; + > exaeere (23.01) 


k=0 

We examine the case when this transformation is only infinitesimally different 

from the identity. In this case the constants a, are infinitesimal and the coefli- 
cients a4, may be written as 

Qin = exdiz + wth (23.02) 


where the «w** are infinitesimal. To stress the fact that the coordinates form a 
contravariant vector, we shall, in this section, write them with superfizes. 


2° = ct; Freee r= y, 2 (23.03) 


We shall likewise write the infinitesimal constants describing the shift of origin 
as a‘ instead of a. In this notation the result of inserting (23.02) into (23.01) 
is 
3 
vt=a2ttat+ > exwtkgk (23.04) 
k=0 
For an infinitesimal transformation the distinction between the given and the 
transformed frame is not important and therefore the differences 
Agt = g't — at (23.05) 
can be treated as the components of a vector referred to either system, say to 
the first one. So the expressions 
3 
Azt = at + > exwtkxk (23.06) 
k=0 
represent an infinitesimal contravariant vector. Hence the constants at them- 
selves form such a vector and the quantities w‘* a contravariant tensor. This 
tensor is antisymmetric, for on inserting (23.02) into the orthogonality con- 
ditions for the coefficients of a Lorentz transformation 


3 
> C1sedat = €xSk1 (23.07) 
i=0 
and neglecting terms of second order in the infinitesimals we get 
afk + wit = 0 (23.08) 


An antisymmetric tensor of rank 2 has six independent components. These 
six quantities together with the four components of the infinitesimal clisplace- 
ment vector a’ are the ten independent parameters that characterize the 
infinitesimal Lorentz transformation. 

Let us perform two infinitesimal transformations consecutively, one with 
the parameters a, w‘*, the other with parameters bt, of*. It is easy to see that, 
neglecting second order terms, the combined result of the transformations is 
equivalent to a single transformation with the parameters 

ct=at+bt; Yk oatk + otk (23.09). 
This shows in particular that the result of two infinitesimal transformations 
does not depend on the order in which they are performed. We can therefore 
consider a displacement of origin, a change to a moving frame and an infinitesimal 
rotation of the axes separately and in turn. 

Putting 

= <c: at=az; a*=ay; a® =az (23.10) 
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we get a transformation describing a change of time origin by + to an earlier 
instant and a shift of spatial origin backward along the vector a and a distance 
equal to the magnitude of a. The passage to a frame moving with an infinitesi- 
mal velocity V is represented by the transformation 

“gf —g = Ax = —Vut 
y —y=Ay=—VD,t (23.11) 
2 —z=Az= —Vz 
of the space coordinates together with the transformation 


, 1 


of the time. 
Going from three- to four-dimensional symbols and comparing coefficients in 
(23.11) and (23.06) we get 


mu Vy, 


@i? — . wo? = — —": @3° = — V. (23.13) 
c c c 
On the other hand, comparing cvefficients in (23.12) and (23.06) gives 
@? — Me @ 9? = Vu, @ 3 — V: (23.14) 
c c ¢ 


@ result that is obvious from the antisymmetry of the tensor w**. 
Let us finally consider an infinitesimal rotation of spatial axes. By a well- 
known formula in kinematics we have 


Ar =[w Xr] (23.15) 
where w is the vector of an infinitesimal rotation, having components equal to 


the angles of rotation about the axes of x, y and z. 
Comparing (23.15) with (23.06) we get 
w23 = —w3? = wz; 0 = —w = wy; wl? = —w2l = w, 
(23.16) 


We also write down the covariant components wy. Lowering indices by the 
usual rule we get 


Vz Vy Vz 

(a) =, ——? G) = (a) _ 
10 c’ an c’ ea te (23.17) 

Weg = Wz; a1 = Wy; O12 = Wz 


with all other components following from antisymmetry. 

In conclusion we remark that it is possible to obtain the equations of finite 
Lorentz transformations from a study of the infinitesimal transformations, but 
we shall not discuss this question here. 


24. The Transformation Laws for the Electromagnetic Field and the 
Covariance of Maxwell’s Equations 

Our derivation of the basic laws of the Theory of Relativity started from the 
sropagation law for an electromagnetic wave front. This law is a result of 
Vlaxwell’s equations as was shown in Section 3. We must now verify that 
Viaxwell’s equations are really covariant with respect to Lorentz transformations. 
[o do this we must first find the law according to which an electromagnetic 
ield transforms in going from one reference frame to another. 
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It is well known that the electromagnetic field can be expressed in terms 
of a scalar and a vector potential by the equations 


120A a 
eat Ee EE, = —-— -—-— 
OX» OL, c a O21 

OB OAS ip ee a (24.01) 
ar; Sn c at 86 ay 
0A, 8A, _ 164, 
ee eer eae ea a 

The field becomes zero when, and only when, the linear differential form 


is a total differential. We assume that this linear form is invariant or scalar. 
This is so if A,, A, and A, are the spatial (covariant) components of a four- 
dimensional vector whose fourth component is 


A,= —® (24.03) 
Bearing in mind that ct = x we can write S¢ in the form 
3 
So = S'Ay day (24.04) 
1=0 
The condition 
Fr aca, (24.05) 
c at 


usually imposed on the potentials is invariant under Lorentz transformations 
because it can be written as 


3 
a 
> Riel) (24.06) 


or, with the introduction of contravariant components, by 
A°=A,=—®; At = —A,; A’? = —A,; At= —A, 


(24.07) 
=. @At 
as ys ae (24.08) 
Inserting z) = ct and A, = —® into (24.01) we rewrite the expressions for 
the fields in the new notation. For the electric field we get 
pats _ Ay 
Oz, 0X9 
_ 04, AA, ae 
On, Oy al 
yp, 24e Ads 
GL, 


while the expressions for the magnetic field remain unchanged. All the 
equations can be combined in the short form 


Li (24.10) 
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where we have 


H, = Fog BE, = Frio 
H,=F, E,=F. (24.11) 
Sy—> Fi» Ey = Fao 


If (A;) is a covariant vector, then (/',) is an antisymmetric tensor of second 
rank. Thus our assumption as to the vectorial nature of the potentials leads 
to the conclusion that the electromagnetic field is an antisymmetric tensor in 
the four-dimensional space-time manifold. From this it is easy to verify that 
Maxwell’s equations are covariant for Lorentz transformations. We construct 
from derivatives of field components the third rank tensor 


OFi, OF a OF y 


(24.12) 
Cx) Om Oxn 


It is evidently antisymmetric. By (22.09) we can adjoin to it the pseudo-vector 


* * * * 
Fo = —F 23; Ft = Fos; FP? = Foig; Fe = Fy (24.13) 


We calculate the components of this pseudo-vector and find 


Ficus Chae OR OP Oe Og ONS sy 
Oz, OX. 0X5 Ox, OL, Ox 
pn OF,, OF o¢ i OF;, 10H, 08, | OB, 
~ Oxy a dt, | Of, Cc Ot Oa, | Oxy (24.14) 

zn oF oF oF 10H, OE, . OB, , 
fee a De eats ees 

OLy Ox, Ox c at Oz, Ox; 
pp OF ., i? OF 5, OF sy _ 1 oH = oH, ok, 


OX OL» Ox, c ot OL, 02, 
For ease of interpretation we have replaced z, by ct on the right-hand sides. 
By Maxwell’s equations 
oH 


eae = =0; divH=0 (24.15) 
Cc 


so that the right-hand sides of all equations (24.14) are zero. 
Therefore the first set of Maxwell’s equations can be written in the covariant 
form 
Fi=0 (24.16) 
Fix = 9 (24.17) 
We now turn to the second set of Maxwell’s equations, passing first from 
covariant field components to contravariant ones: 
F3—H, F=—E#, 
Fu- TT, F°=—E, (24.18) 
Fu H, F230 — _f, 


or 


We form the contravariant vector 
3. aKik 
t= = 


OXLk 


(24.19) 
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Then 
oF aF? aF?% ok, OE, OE; 
3s =0 4. —  __ SS EE ST 
OX, OL OL OL, OL» OL, 
oF 10 OF12 = aF}3 1oE, oH, 20H, 
st= 2 404+ 2-4 5—-=-- ae 
OL OL» OL c a OL» OL (24.20) 
aF2® oF 0 oF 1eE, 0H, OH, 
= 0X9 a 02, ot ar, c ot Ox; Ox, 
OF? OF: af 0 10k, 0H, 2H, 
ita OL, Ox, CL» 6 at Ox, OX» 


so that the zero component of s* is 

s° = divE (24.21) 
while the spatial components are equal to the components of the three~ 
dimensional vector 


oE 
(s!, s*, 83) = — : ae + curl H (24.22) 


In free space the right-hand sides of these equations are zero and therefore 
the second set of Maxwell’s equations in free space can be stated in the form 


3 1k 
pas 2 (24.23) 


In a space occupied by charge of density p the Maxwell-Lorentz equations 
have the form 

loE 4n 

i E — < 1H _—-=— SS 

div 4p; cur. eae j 


where j is the current density. The covariance of Maxwell’s equations is there- 
fore assured if the quantities 


(24.24) 


4 
s°=4np; t= = je) (6 =1,2,3) (24.25) 


represent the contravariant components of a four-dimensional vector. By 
the identity 


3 
CF 
= 0 24.26 
hi ea 
3 
ast 
we have a0 (24.27) 
Ont 
{=o 
ae) ‘ 
whence a +divj=0 (24.28) 


As is well known, this equation expresses the law of conservation of charge. 

The fact that the quantities (24.25) form a four-dimensional vector is 
perfectly consistent with the physical interpretation given to them by Lorentz. 
According to him j = pv where is the density of the charge and v its velocity. 
The four-dimensional current vector 


PC, PV1, PVa, PVs (24.29) 
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is proportional to the four-dimensional velocity vector 


c VY, 


Vay Tae 


Th 
24.30 
” ‘. (24.30) 


as ya : 3 
SVE = ye Tee 
(see (20.18)) with the coefficient of proportionality 


x p* = J( = “) (24.31) 


This quantity is an invariant which has the physical significance of the charge 
density in that reference frame in. which the charge is momentarily at rest. 

We have verified the covariance of Maxwell’s equations and have established 
that the transformations undergone by the electromagnetic field when going 
from one reference frame to another are those of an antisymmetric tensor of 
the second rank. We shall now state the transformation laws in an explicit 
form, introducing for the general coefficients of the Lorentz transformation 
their values for the case that the transformation does not involve a spatial 
rotation. We then have 


1 
00 AL = VCR) 

a a 
Oy = a no - Ve -@= 12,3) (24.32) 

ViV 

dix = —Biz — (too — 1" (i, B= 1,2, 8) 
Inserting these values into the general expression 
3 

Fix = Cee > ayauk yj (24.33) 


j,l=0 
and using the antisymmetry of Fy, we get 


' V 
Py; = Aol 23 ae (20 poe 1) a7 (VF 23 + VF's, + Valj3) 
1 
es Boo (VeF'39 — VgF a9) (24.34) 


and two similar expressions obtained from (24.34) by cyclic permutation ofthe 
suffixes 1,2 and 3. We also get 


; V 
Fyo = GF yo — (@o0 — 1) = (Vik yo + VoFoo + Val’ go) 
V 
1 
+ 3 Goo (V2F 2 — VF's) (24.35) 


and two expressions obtained by cyclic permutation. Using (24.11) to pass to 
the usual notation for the electric and the magnetic fields and using three- 
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dimensional vector algebra, we may write 


; : V a 
Ey = Agylty — (@o0 — 1) 3 (V °F) aera x Hj, 
(24.36) 
' vy Zoo 
HH, = Gol, — (490 — 1) Pe (V-H) — os [V x E}, 


Passing from the components to the vectors, replacing @, by its value and 
regrouping terms a little, we obtain the final expressions 


Vv 1 V 1 
E’ = — (V-E ——-_~ [E- = (V-E -{[VxH 24.37 
1 V if 
———. ([H — = (V:H)—-[VxE 24.38 
For comparison we shall also state the transformation formulae for a four- 
dimensional covariant vector, using the usual vector notation for its space 
components. 


Bice 


V 
and H! = =, (V-H) + 


1 
Tamra (405 0-4) se 


Vv 1 V 
d A’ =A——(V-:A eae eer ae 
= BY) +e 


We note that the equations for the transformation of the fields give 
E'-V=E-V; H’-V=H-V (24.41) 
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(v at = Aq] (24.40) 


This means that those parts of the electric and the magnetic field which are 
parallel to the velocity V remain unchanged. On the other hand for the spatial 
part of a four-dimensional vector the part perpendicular to the velocity remains 
invariant, for we have 


, v , Vv 
A 5 (V-A) =A — 7, (V-A) (24.42) 
If we consider the particular Lorentz transformation with 
the transformation equations simplify, giving 
E,, = E.; H, = H, 
1 V 1 V 
ES ee ee a 
0 70 ra | a: ); By = Tava (He + B.) 
: 1 V i V 
E SS E, = A > H' ae 2 ae 
Sma (B+ ce) = yey (™ 5 B,) 


(24.44) 


One can construct two combinations from the field components that are 
invariant under Lorentz transformations. We have 


EB? —H’? = EB? — HP (24.45) 
and E’- H’=E-H (24.46) 
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The first of these quantities is also invariant under improper Lorentz trans- 
formations (see Section 22) and is a scalar. The second changes sign in improper 
transformations, and therefore is a pseudo-scalar. This can easily be seen in 
tensor notation. We have 


l 3 
EB? —He= — = tk 
5 » FuF (24.47) 
t,k=0 
1 * 
se tk 
and j > Fi,F (24.48) 


a 
where F?* is an antisymmetric pseudo-tensor related to the field tensor by 
equations analogous to (22.06) and (22.07). We note that for a plane wave 
the quantities (24.45) and (24.46) vanish (in all frames). 

As do Maxwell’s equations themselves, the transformation equations for the 
field reveal a very close relation between the electric and the magnetic field. 
For instance, a field due to charges at rest in some frame will be an electrostatic 
one in that frame ; but in another reference frame which moves relative to the 
first, a magnetic field is found in addition to the electric one. This is readily 
understood since a moving charge represents a current. Similarly a field that is 
purely magnetic in one frame will appear in another frame as a superposition 
of a magnetic and an electric field. 


25. The Motion of a Charged Mass-Point in a given External Field 


We now consider the motion of a particle of mass m and charge e in a given 
external field. As we already saw (Eq. (20.23)) the four-dimensional vector 
of acceleration has the components 


spies 1 d ( c ) ; 
Vl — v8) dt \7 — 0%) ° 
1 d 1 dx, 
V (1 — v3/c®) de (Jaa di 
The quantities here being differentiated with respect to time are the com- 
ponents of four-dimensional velocity 


G =1,2,3) (25.01) 


we = 


1 dx; ; 
se! Va — v{c%)’ it 2 aa (i = 1,2,3) (25.02) 
ee ee 


We introduce a primed reference frame such that in it the instantaneous 
velocity of the particle is zero. We then make the assumption that in this 
frame, which is instantaneously travelling with the particle, the acceleration 
as usually defined is proportional to the electric field 
d’r,_ ¢ E’ 
dt’? m 
This equation must now be stated in the original frame. 
In the primed frame the four-dimensional velocity and acceleration are 


w=c, whaw?=u?=0 (25.05) 


(25.04) 
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d2z't 

dt’? 

Thus in the primed frame the relation between the four-dimensional accelera- 
tion and the field is 


and w°=0, wis (25.06) 


w=0, wie < Bi (25.07) 


Using the transformation rules for a contravariant vector (which are the same 


as the transformation rules for the coordinates) we find that 
3 


wi = > exanrw’® (25.08) 
k=0 


Inserting into the right-hand side the values for w’* from (25.07) we get for the 
four-dimensional acceleration in the original system 


wt = — = (ay By + OygBy +5 8s) (25.09) 

The values of the aj must be inserted into this from (24.32), giving 
ee 2 0y. z (25.10) 
and ‘phe <(B; oe ee ne ‘(V-E ») (25.11) 


Finally E’ must be expressed in terms of E and H. Since V-E’=V-E, 
equation (24.36), with ao, replaced by its explicit value, gives 


o_ & 1 ° 
wo? = = ve TET (V -E) (25.12) 
ane wt = £ a aa (F +: six H}), (25.13) 


Here V is the velocity of the frame moving with the particle ; at the instant 
under consideration V coincides with v, the velocity of the particle. There- 
fore we can rewrite the previous equations as 


0 _ © 1 . 
; e€ c 1 


Now we can abandon the moving frame and take it that the equations just 
obtained are valid at any instant of time. They are thus the equations of motion 
of the particle. However, the form of the equations (25.14) and (25.15) is 
inconvenient owing to the mixed notation used ; on the left we have a four- 
dimensional vector while the right-hand side involves three-dimensional 
quantities. We now make the notation uniform, by first stating the equation 
entirely in four-dimensional and then entirely in three-dimensional terms. 
Using the equation (25.02) for the four-dimensional velocity and equation 
(24.11) for the field, we can write equations (25.14) and (25.15) in the form 


w? = — (WF 9 + u2F , + uF,,) (25.16) 
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€ 
and wt = male E10 tuk, +P, + uF.) (¢ = 1, 2, 3) (25.17) 
Because of the antisymmetry of the tensor Fy the right-hand sides of all 


equations (25.17) contain in fact three and not four terms. Passing to the 
covariant vector of acceleration 


Wy = w®; Ww, = —w}; Wy = —w?; Ws = —w3 (25.18) 
we have for all four values of 7 
3 
e@ 
en ee Tk, k 
wi — Zz ukF iy (25.19) 


On either side of this equation we have a covariant vector so that it is obvious 
that the equations of motion, written in this form, are valid in any reference 
frame, as it should be. It is easy to verify that 


3 
> utw;, = 0 (25.20) 


i=0 
in agreement with (20.24). This means that the first equation of motion is a 
consequence of the three others. 

In view of the covariance of equations (25.19) it would have been sufficient 
for proof of their validity, to verify that in the primed frame they reduce to 
the form (25.07). 

We now pass to the three-dimensional statement of the equations. Using 
the expression (25.01) for the four-dimensional vector of acceleration and 
introducing three-dimensional vector notation we have 


d 2 

5 (saa) eee 5, (25.21) 

d mv 1 " 
and 7 (sas) = o( + ; [v x H)) (25.22) 


(In (25.21) the right-hand side is the power, or rate of work done by the field 
yn the particle ; by general mechanical principles one should therefore interpret 
the left-hand side as the rate of increase of the kinetic energy of the particle. 
{In (25.22) the right-hand side is the Lorentz force on a moving charge and the 
eft-hand side the rate of incrcase of particle momentum. 

Thus the kinetic energy and the momentum of the particle can differ only 
»y constants from the expressions 


me? - 
W= Va ve) (25.23) 
mv ee 
nd P= Va — ee) (25.24) 


rhose time derivatives occur in the equations of motion. 

The values of the constants can be determined by a covariance requirement ; 
ne must postulate that the encrgy and the momentum of a particlet form a 
ur-dimensional vector. 

Now, the quantities (25.23) and (25.24) are themselves a four vector, for 


+ More aeeurately the energy divided by ¢ and the momentum. 
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they are respectively proportional to the temporal and spatial components of 
the four-dimensional velocity. Therefore the constants must vanish.{ Con- 
sequently the quantities (25.23) and (25.24) are the energy and momentum 
of the particle. 

If we take v = 0 then, in agreement with older ideas, P = 0, the momentum 
of a body at rest is zero ; but for v = 0 the energy W takes on a value different 


from zero, namely 

W = ine? (25.25) 
This result disagrees with older notions, but is fully confirmed by experiment : 
any body of mass m represents a store of energy W, (see Section 34, below). 
Asimilar relation 


W = Me? (25.26) 


between the mass and the energy of a body holds also at arbitrary velocity 
if M is understood to mean the quantity 
m 


M= Wa — oe) (25.27) 
which is itself a function of the speed. The quantity M must be taken as the 
rational generalization of the concept of mass. It enters as the multiplier of 
the velocity in expression (25.24) for the momentum and so describes the inertia, 
or inertial mass, of the body. M is usually called simply the mass while m 
is termed the rest-mass. Correspondingly W is referred to as energy and W, 
as rest energy. 

The rest-mass m is a constant independent of the state of motion of the 
particle as a whole, which may, however, depend on the internal state of the 
particle, if the latter has some complex structure and some internal degrees 
of freedom. The rest-mass can be expressed in terms of the invariant of the 
four-dimensional vector of energy-momentum. Indeed, this invariant is 
independent of velocity and equal to 

2 

eC =P Ste” (25.28) 

The mass M depends on velocity in such a way that if the particle approaches 
the speed of light its mass increases without limit. Consequently no body 
having a rest-mass different from zero can attain the speed of light in any 
frame of reference. This fact confirms strikingly the limiting property of the 
speed of light. 

If the speed of a particle is small compared to the speed of light the difference 


4 


1 30 u 
W — me? = - mv? +—-m— +... 25. 
me 5 mw color + (25.29) 


will only differ by small quantities from the usual expression for the kinetic 
energy. 
The foregoing relations were derived for the equations of motion of a charged 
{ A more detailed argument goes as follows: the constants must on the one hand form a 
vector, on the other they must he independent of the state of motion and, hence, of the frame 
of reference. However, a constant vector dificrent from zero, with componcnts independent 
of the frame, does not exist. 


The Theory of Relativity in Tensor Form 17 


particle but they have more general validity. The relation (25.26) between 
mass and energy holds not only for the present case of a particle of invariable 
test-mass, but also for any complex body or system of bodies in which internal 
processes can effect changes of rest-mass. The relation expresses the funda- 
mental law of proportionality of mass and energy. We shall return to a dis- 
cussion of it in Section 34. 


26. Approximate Description of a System of Moving Point Charges 

The problem of the motion of a system of charged mass points requires a 
simultaneous determination of the motions of the charges and of the electro- 
magnetic field in which they move; only an external field can be considered 
as predetermined. The ficld transmits the interactions between the charges 
with a finite velocity, namely c. Consequently the force acting on a given 
charge will depend not on the momentary state but rather on the past state 
of motion of all the other charges. The field arising in the accelerated motion 
of charges not only carries the interaction but also radiates into outside space, 
therefore the energy of the charges will be spent in part on radiation and the 
systein of charges is not a conservative one. Also it must be borne in mind 
that the field possesses an infinite number of degrees of freedom and therefore 
the system of charges with thcir ficld is strictly speaking one with infinite 
degrees of freedoin and thus not a purely mechanical one. 

Nevertheless the problem of the motion of charges can be approximately 
formulated in mechanical terms, involving only the degrees ‘of freedom of the 
charges themselves. To do this one must express the field, or rather the 
potentials, of each charge in terms of its position and velocity and, further, 
one must find an approximate way of relating quantities at an earlier to 
quantities at a later instant. The potentials at a given place and given time 
are in this way expressed approximately in terms of the positions and velocities 
of the particles at the same instead of at an earlier time. An approximate 
Lagrangian for a system of charged particles may thus be constructed. 

We find first the Lagrangian forin of the equations of motion of a single 
charged particle. It is easy to verify that equations (25.22) can be derived 


from the Lagrangian 
Toe: met | -") —eb + - (A-v) (26.01) 
c 


where ® = —A, and A, are the scalar and vector potentials respectively. For, 
since £4 = Vv; we have 


oL NV: 4 e€ 
Oats a lve) ce * 

Ve ae 
ol op é OA, 
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Inserting these expressions into Lagrange’s equations 
(3) alee, (26.03) 
t 


0x1 
‘ and remembering how the fields are expressed in terms of the potentials by 
(24.01) we get 


OF 


ad MU 1 

ia) — o( Bi +- aly x H}) = 0 (26.04) 
which is the same as equation (25.22) and of which we also know (25.21) to be 
@ consequence. 

We now pass to the case of a system of particles. We seek an approximate 
expression for the fields of particles of known motion. Considering first a 
spatial distribution of charge we can write down the equations for the po- 
tentials as 


1 a® 

1 @A 4n , 
and AA — oo = i (26,06) 
The solution of (26.05) that is required is the retarded potential 

© = [a lela (26.07) 
FFI 
/ 1 / 

where [e]=elr', ); t= t—s |r —r’| (26.08) 


If the motion of the particle is sufficiently slow and sufficiently smoothf then 
for not too great distances the expression for [¢] may be replaced by the first 
few terms in an expansion in inverse powers of c 


| 8, 1 a 
[e] = elf’, t) —- [pF —F’] 5 elt’ 1) tSalho =| Bp OF ) + - 
(26.09) 
Inserting this expansion into (26.07) and using the fact that the integral 
| o(r’, t) dV’ = eg (26.10) 
is the charge of the particle and therefore independent of time, we get 
o(r’, t) a 1 d? ; ; ; 
® = [ie + age | OFF av +... (26.11) 
If we now assume Le - charge is concentrated near the pointt 
r =f, =1Fa(t) (26.12) 
we obtain from (26.11) ; 
€a €q @ 
OD — 
|r —ra| + a8 Qc Bet Pom ete) 


+ These requirements impose a limitation on the speed of the particle (v?<c®), on its 
acceleration and on higher time derivatives. 


¢ The suffix a in eg, rg, vg, etc., now denotes the number of the particle. 
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Putting Ta(t) = va (26.14) 
and performing one of the time differentiations in (26.13) we have 
Ca Ca ra) Va'(r —— 2 
is 2 ee 
|r —Ta| Qc? at ( |r —Ta| eel) 


Here the first term gives the Coulomb potential, as for a charge at rest, while 
the second is a retardation correction. 

In the expression for the vector potential we need not include any correction 
terms§ and we may write it as 

Ap ea (26.16) 
e|r — ra 
(This expression for the vector potential is obtained if in (26.06) second deri- 
vatives with respect to time are neglected, if the current density j is replaced 
by eVq and if the transition to a concentrated charge is made.) 

The form (26.13) of the scalar potential is inconvenient because it involves 
not only the velocity va but also the acceleration Vg of the particle which 
produces the field. This inconvenience can easily be removed by using the fact 
that the fields are not changed by a “ gauge transformation ” of the potentials, 
ie. by the substitution 


1] 
® +> +.3 
3 a (26.17) 
Ay —> Ag —_— 
0x4 


where x is an arbitrary function of space and time coordinates. We choose 

__ ¢aVa -(r —fq) 

490 |e ral 

and thereby cancel the second term in (26.15) at the expense of complicating 
the expression for A; a little. The result is 


(26.18) 


(26.19) 


and 


A —% ( Pat yy —~ Xax)(a4 — =) (26.20) 


~ Be r—fq| = |r —ra|* 


It is easy to verify that the new expression for the vector potential has vanishing 
divergence : 


S OAg _ (26.21) 


Insertion of these values for the potentials into the last two terms of the 
Lagrangian (26.01) gives 


e ea eéa [ (V°Va) {Va'(r —TYa)}{v-(r —Ta)} 
SMES) ge as pee eon 


(26.22) 


§ In the Lagrangian and in the equations of motion the terms in the vector potential are 
of the same order as the terms involving corrections to the scalar potential. 
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This expression is symmetric in the particle producing the field and the particle 
being acted on. We can consider it to be an approximate statement of the 
interaction law for two particles. It allows us to write the Lagrangian of a 
system of particles as 


) 
en > mat /(1 - 0 -52. i 7 
a—Tp 


Gen 
1 (rarvo) (area — eho‘ (ee — #0) 
ary ee eo EE 26.23 
re a a [ra —ro|* ( ) 
(a#b) 


Incidentally, since quantities of order v?/c? are assumed small in the interaction 
law it will be more consistent to neglect higher terms in this quantity every- 
where. Therefore taking only the first terms in the expansion of the square 
root we can replace (26.23) by 


4) 1 
b= —Wot D (Gm +93) 3D fe oy 
a+b 


iz ’ yaa ( ivatVo)e. (hers Soli eh) (26.24) 


Ta —To| [ta —To|* 


where 
Wg Se iy (26.25) 


is the sum of the rest energies of all the particles. 

We now consider the question of Lorentz transformations in the problem 
of the motion of a system of particles. Assume the problem to be solved in 
some frame of reference. This means that in the given frame the spatial co- 
ordinates of each particle are known as functions of the time in the frame: 


We now perform a Lorentz transformation and seek to express the new space 


coordinates z,, y,, 2, a8 functions of the new tiine t’. The equations of the 
Lorentz transformation have the form 


, 1 
ro=r—Vi+ (sa \t Pa {V-(r — Vi)} (26.27) 


and be de Oe ee ( : Vv 

= Tava |b aa r) (26.28) 
Therefore the new functions of the new time can be obtained by eliminating 
the variable ¢ from the equations 


; 1 v 
Ta =T,(t) — Vt + La pais = 1) vi (V -{ra(t) —_ Vi}) (26.29) 


pie 1 1 
and C= Van Pye) ( ee {V ra(d)}) (26.30) 


In other words, the solution of (26.30) for ¢ must be found and inserted in 
(26.29). This can only be done approximately taking V2/c? to be small com- 
pared with unity. The approximate solution of (26.30) is 
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=— |] ue U 1 , ‘ 
Before inserting this into (26.29) we write the latter in the simplified form 
j é v 
where r, =1,(t) — Vt + or ({V -r,(¢)} — Vt) (26.32) 


Inserting now, we get approximately 
1 44f , r 1 , 1 , Vv 
rl’) =r,(v’) — Ve + x v(t’): (- 5 Vy + V4) ors {V-r,(t’)} 
(26.33) 
(26.34) 


Volt’) = (Fr) 


Differentiating (26.33) with respect to t’ we obtain the corresponding expression 
for the velocity 


1 
Vall) = Vylt) -V + 3velt) (—5 78 + V-¥Q(09) 
Mi V- : 1 le 1 ry : 26.35 
— aa WV val) + S¥elty(—5 V+ (Wena) 28.35) 


where Vg is the derivative of v2 with respect to its argument. 

We note that the relation between ¢ and ¢’ depends on the number a of the 
particle ; this is understandable because the transition to the new definition 
of simultaneity requires the introduction of different corrections for different 
particles. The old definition of simultaneity used in the original frame involved 
taking the same value of ¢ for all particles (so that the t’ values were different), 
but with the definition of simultaneity used in the new frame the values of ¢’ 
for all particles are taken equal, so that the ¢ are different. We can there- 
fore omit the suffix a on t’. We can also replace t’ by the unprimed symbol ¢ 
and omit this argument from all functions in (26.33) and (26.35). We can then 
write these equations more briefly as 


‘ ] 1 Vv 
r=", SMCS aa (57% +0-1))- vr.) (26.36) 


and 


rs 1 oe 
Nip Nar ae ate +(V-¥9)) = 55 V-v.) 


1, 1 
+ oe Va (- 3 V% + (V 1,)) (26.37) 


This is the form of the new functions r, and v, expressed as functions of the 
new independent variable which, however, we have denoted by the same 
symbol as the old one. 

The equations derived are approximate but on the assumption that the 
speeds V and v, are of the same order the terms omitted are of the same order 
as those that are inevitably neglected in the construction of the retarded 
Lagrangian. Thus the omissions are in conformity with the nature of the 
sroblem. 
~ It is well known that equations of motion are obtained by varying the action 
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integral 
(2) 
S= | Ldt (26.38) 
(a) 
and that in the usual case that the Lagrangian depends only on the coordinates 
and the velocities, and possibly the time, the equations of motion are 


éL 
ees (26.39) 
dt O%q  O2,¢ 
If the accelerations also enter the Lagrangian the equations of motion take 


on the form 
a@oL d oL aL 
dt? Oia 7 dt 0iq  OXq 
where %, _ and Z, are the components of acceleration of the a-th particle. 
If the action integral S is invariant with respect to a Lorentz transformation 
the equations of motion will obviously be covariant. 
Let us consider, for example, the equation of motion of a single particle, as 
derived from the Lagrangian (26.01). In this case the action integral can be 
written as 


3 
S= —me | dt +S] > dzx (26.41) 


(with x)» = ct). Its invariance follows from the invariance of the differential 
of proper time and of the differential form (24.02) in the potentials. 
However, the invariance of the action integral is not a necessary condition 
for the invariance of the equations of motion ; it is sufficient that a Lorentz 
transformation should leave the variation of the action integral unchanged. 
This can be seen for example in (26.41) if one subjects the potentials to a gauge 
transformation of the type (26.17) with some function y. Then S changes into 


Gees - (2) — yy) (26.42) 


where x) and y‘*) are the values of y at the two limits of the integration. 
Since the variations vanish at the limits, we have 

dS’ = 58 (26.43) 
and the equations of motion do not change, as it should be, since a gauge 
transformation of the potentials does not affect the fields. 

We apply these considerations to our case of the motion of interacting 
charges and to the Lagrangian (26.23). We denote by L’(t’) the expression 
obtained for the Lagrangian L(t) by replacing ra(t) by rj(¢’) and va(t) by v;,(¢’) 
from (26.33) and (26.35). If the original action integral was 

(2) 
S=|Ll(t)dt — (26.44) 


the transformed integral will be 


s= i L(t’) dt’ (26.45) 
(i) 
and the change in the action integral due to the Lorentz transformation is 
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Ss —S= | Tia = | L(t) dt (26.46) 
a) e) 
or, if we call the integration variable ¢ in both integrals 
3 (2) (2) : 
So -—S= i L(t) dt — | L(t) dt (26.47) 


q) qQ) 
Here L’(t) is the expression obtained from L(t) by replacing ra by r, and Vg 
by v, according to (26.36) and (26.37). 
It follows from (26.47) that in order to ensure the covariance of the equations 
of motion it is sufficient that the difference 


{L'(t) — L(t)} dt = dF (26.48) 
is a total differential of some function F. For then 
S—S = Fe — Fo (26.49) 


where F‘?) and F) are the values of the function at the limits, so that the 
variations of the two integrals will be equal. 

By performing the calculation we can verify that (26.48) really holds with the 
function F given by 


V2 
si >(- Mek g yma — 35 tv -ro) — 34) 


(b#a) 


1 V2 V4 
-5 Z ma(V-ta)(V Va) + 35 2 ma(V ta) ++ a 2 mat (26.50) 


This also proves that the equations of motion derived from the Lagrangian 
(26.23) or (26.24) are indeed Lorentz covariant to the approximation stated. 


27. Derivation of the Conservation Laws in the Mechanics of Point 
Systems 
We now turn to the derivation of integrals of the equations of motion. To 
this end we consider the infinitesimal Lorentz transformations studied in Section 
23 in the form involving all ten parameters. We recall that a transformation 
with these parameters expressed the following: a change + of time origin, a 
displacement a of the origin of the spatial coordinates, a transition to a new 
reference frame moving with velocity V relative to the original frame and an 
infinitesimal rotation of the spatiai axes characterized by the quantity w. 
We have to calculate the change, resulting from such transformations, in the 
functions of time that represent the coordinates of each particle. This variation 
dry has two causes, the first arising from the vectorial nature of rp and the 
second from the change in the argument t. 
According to (23.06) and (23.17) the former variation is 
Ar, =a —Vi+[w x rp] (27.01) 
The argument ¢ appearing in quantities that refer to the b-th particle is varied by 


1 
Aty ="t — —(V -rp) (27.02) 
c 


where the first term_comes from the change in the time origin and the second 
‘rom the transition to the moving frame. (Equation (27.02) is one of the 
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equations of (23.06)). 
The variation A*r, due to the change of the argument of rj(t) is obtained 


from the equation 


r(Q=r(')  ( =t +A4,) (27.03) 

whence r(t) =r,(t — At,) =4,(t) — v,At, (27.04) 

and, therefore, A*r, =1,,(t) —1,(t) = —v,A4, (27.05) 
Thus the total variation in the form of the function rg(t) is 

dry = Ary + A*ry = Ary — vp Ato (27.06) 


Using (27.01) and (27.02) the total variation d5rp) can now be expressed in terms 
of the parameters of the infinitesimal Lorentz transformation as follows: 


1 a 
ory = —Vet+a—VWit+ a vo(V-ry) +[w X ro] (27.07) 
and evidently the total variation dvp of the velocity vp will be 
d 
= — 27. 
8vp 7 Srp (27.08) 


In the expression for 5r, the terms containing V are the same as the terms of 
first order in V in equation (26.36), as was to be expected. 

We now calculate by two methods the variation of the Lagrangian in an 
infinitesimal Lorentz transformation. The first method is based on the use of 
the equations of motion, the second on direct transformation and the use of 
(26.48) and (26.50). We have 


Shs > (=. Bv— + “-Ba) (27.09) 


since the Lagrangian does not aNeulye time ee 
Here @L/ér, is to be understood to mean the three-dimensional vector with the 
components OL/ 0%, OL/éy_ and @L/ Oz, and @L/éva is defined correspondingly. 
Using the equations of motion and also the relation (27.08) we can write 


a 
SL = 5 a Sra (27.10) 


On the other hand, as mentioned batore; we can evaluate the variation in the 
Lagrangian Z without using the equations of motion. Evidently L does not 
vary when the spatial origin is displaced or when the spatial axes are rotated. 
If the zero of time is changed L varies by —+-dL/dt for if L’(t’) = L(t) where 
t’ =t++, then 


L(t) = L(t — +t) = L(t) —t— (27.11) 


The change in L as a result of passing to a moving frame is given by (26.48) 
where it is sufficient to take for F the terms of (26.50) which are of first order 
in V, namely 


F= > jag Mao eo irresrr Me Tq) (27.12) 


(a#b) 


dL dF ad 
us SL at a ge (27.13) 
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Equating (27.10) and (27.13) we get 


d ~ OL 
oL 
whence — -drg+ Lr — F = const. (27.15) 
- OVa 


The left-hand side of this expression is a linear function of the ten parameters 


of the Lorentz transformation. Inserting into (27.15) the expressions (27.07) 
for $rq and (27.12) for F we can put 


f=-<W+a-P+V-K+w-M = const. (27.16) 
where W, P, K and M do not depend on the parameters and are given by 
~~ éL 
W = ‘—— 
Z Vee A (27.17) 
aL 
P= paras 
> (27.18) 
v ea eb 1 oL oL 
K ee n a t ee: 
ys la Ma — + set D, [rar 2 =) a Bi 
(b#a) (27.19) 
éL 
M = — 
E x *| (27.20) 


The left-hand side of (27.16) must be constant whatever the values of the 
parameters t, a, V and w. This can only be if the quantities, W, P, K and 
M are themselves constant. Thus we have obtained ten integrals each of which 
is connected with a particular one of the ten parameters of the infinitesimal 
Lorentz transformation. The physical meaning of these integrals is easy to 
see. W is the energy, P the momentum, K the integral of mass centre motion 
and M the angular momentum. Thus the ten integrals we have found are the 
classic integrals of the theory of systems of mass points, but with corrections 
for Relativity. 
Let us write down the integrals explicitly. We have 


i 3 a €ald 
Wine DGG De ie a \ra —Fo| 


. 1 
fe =) eae o(Va‘ Vd) Beet > ge ees, {Va‘(fa —To)}{Vo" (Fa —F)} 
4c? \to —Fo| dc? Ly |ra —Fp 
a,b a, b (27.21) 
axb (a#b) 


This is the energy of the system. 
If we put 
W=c? Dm +E (27.22) 
a 


the quantity H will be the energy as it is usually defined, namely, so as to 
vanish for zero velocities and infinite distances between particles. Side by side 
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with the energy we shall consider the corresponding mass 


E 
M — W —- Ma + — (27.23) 
c c 


which may be called the total mass of the system. By the definition just given 
the total mass of a system is equal to the sum of the rest-masses of all the 
separate particles and of the mass corresponding to their kinetic and inter- 


action energies. 
Let us now discuss the other integrals of motion. By (27.18) the integral of 


momentum is 
v4 Caled (fa —Po){Vo° (Fa — a) 
P= > mara +54) + aD [re —T 5] (v. CU — Ae aes 
a (27.24) 


To write the integral of the mass centre motion in an explicit form we introduce 
for brevity 


ve é. ~~ e 
Mt =m (14+ —% é fines aot 27.25 
* ma a a) a 22 2 Ira —ro| ( =) 
(a#b) 
Then we have 
K = ¥ M*r, —1P (27.26) 
a 
It should be noted that 
SI Sal (27.27) 
a 


where M is understood to mean the expression (27.23) taken to the same 
approximation as that in which M* is given: this corresponds to the non- 
relativistic approximation for the energy LE. 

Introducing the position vector R of the mass centre of the system by the 
relation 


MR = > Mtr, (27.28) 
a 
expression (27.26) for K takes on the form 
K = MR— (P (27.29) 


the fact that K is constant represents the law of motion of the mass centre. 
Finally, the integrals of the angular momentum of the system are 


M = y2 ma(1 + 5 + st,)ire x Val 


+a ea | (ir fee 2) (27.30) 


oF Fe — Fol? 
Gan 


28. The Tensor Character of the Integrals of Motion 


We must now investigate the behaviour of the integrals of motion under 
Lorentz transformations. If we knew from the start that 
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I= —1W+aP4+V-K+o-M (28.01) 
was an invariant we could decide the tensor character of W, P, K and M from 
the properties of the parameters t, a, V and w, of the infinitesimal Lorentz 
transformation. Since, however, we have not proved the invariance of (28.01) 
we shall adopt a more direct approach, though it involves rather tedious 
calculations which we shall refrain from elaborating. We shall subject W, 
P, K andM to a finite Lorentz transformation and see how they can be expressed 
in terms of the initial quantities. 

We shall denote by W’, P’, K’ and M’ the quantities resulting from putting 
r, and v;, in place of rz and vg, with the use of (26.36) and (26.37). 
We obtain 


1v2 3784 1v? 
Vv 
an pap v4 ¥fy.(p_vM) 08.09) 
c 2c? c? 


or, in terms of the total mass, 


1 3 V4 1 V? . 
eM’ — («: we 5 ve+ sa)M ae (: Ae 5a) VP) (28.04) 
Vv 
and P’=P—VM + 508 {V-(P — VM)} (28.05) 


To the same approximation these equations can also be written as 
eM’ = eM — (V-P) 
V(1 — V?/c?) 


V 
1) pifV(P—VM)} (28.07) 


(28.06) 


1 
and pap —v + (pay - 
Comparing these equations with the usual ones for Lorentz transformations 
(26.27) and (26.28), it is apparent that the components Pz, Py and P; of the 
vector P transform like the coordinates x, y and z, and the total mass like the 
time ¢t. This means that the set of quantities 


pkgoe. pip. PSP PSPs. .Qs08) 
Cc 


is a contravariant vector. The covariant components of this four-dimensional 
vector are 
W ‘ 
Po = Me=—; Py=—Ps; Pea=—Py; Ps=—Pz (28.09) 
c 
If we consider the whole system of charges as one compound body, we must 
ascribe to this body the mass J/ and the momentum P. We can also ascribe 
to it, or more exactly to its mass centre, the velocity 


P 
=-- 28.10 
"~M ee, 
P2 
and the rest-mass = Wits = =) (28.11) 


The quantity p is equal to M’ in that frame of reference in which P’ = 0. It 
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is evident from this example that the rest-mass of a body depends on its internal 
state, in the present example on the state of motion of the constituent particles. 

We now turn to the transformation law for K and M. Making the same 
insertions from (26.36) and (26.37) we get for K’ and M’ 


V2 } 1 
k= (1+ 55) K-00) — SIV x M] (28.12) 
v2 1 2 
and = M’= (: oi i) M — =, V(V-M) + (1 + a x K] (28.13) 
To the same approximation we can also put 
K’ _~* w-x) ie ta (x ee 2 tee x My) 
V? /(1 — V2/c?) ae c? 
(28.14) 
_ ov 1 Vv , 
and M’ = 55 (VM) + Taam (m 7a (VM) + LV x Ki} 
(28.15) 


We compare these relations with those describing the transformation of an 
antisymmetric tensor, bearing in mind that in the three-dimensional sense K 
is a polar vector and M an axial one. The required transformation equations 
have been stated in Section 24 for the case of the electromagnetic field tensor 
(equations (24.37) and (24.38)). To make the two sets of equations coincide 
we must take it that the vector M transforms like H and the vector K like 
—E/c. The other possibility M ~ E, K ~ H/e is excluded because K and E 
are polar and M and H axial vectors. We can therefore introduce an anti- 
symmetric tensor in terms of its covariant components 


Ay, = Mz > Ms, = My 3 ar = Mz, 


: 7 (28.16) 
My) = —chz; M5 = —cKhys My = ch, 
or equally well in terms of its contravariant components 
MPS Mg AL?) == AM, ; Me? = M 
. : (28.17) 


fd oh s M»® = cKy; AT 29-= eK 
A check on the correctness of this is the observation that for a single particle 


My, = m(x?u3 — xu?) ete. 
(28.18) 
cK, = m(atu® — 7u), ete. 
where w®, w!, w? and wu? denote the components of the four-dimensional velocity 
and where, to stress their contravariance, space and time coordinates have 
been written with superfixes : 2° = cl, at =a, x? = y and 2 =z. 

We have thus shown that of the ten integrals of motion four, namely energy 
and momentum, form a four-dimensional vector and the remaining six, the 
integrals of mass centre motion and the angular momentum, an antisymmetric 
tensor. This allows one to assert that if these quantities are constant in any 
one fraine of reference they will be constant im all frames. The circumstance 
that we deduced the conservation laws from approximate equations is related 
to the approximate character of the statement of the whole problem. We 
have already remarked that strictly speaking the energy and the momentum 
of a system of charges do not remain constant but may be dissipated as radiation. 
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Having established the tensor character of the integrals of motion we can 
verify that the expression (28.01) is really invariant. To do this it is sufficient 
to rewrite the expression for J in four-dimensional notation. According to 
Section 23 we have 

tT 
De Oo; az = —d,; Gy = —a,; az = —a, (28.19) 


where do, @,, @, and a, form a covariant vector. 
Further 
Viz = CW493 Vy = CWo9; Vz = CWa9 (28.20) 

and Wz = Wy33 Oy = O31 5 Wz = Wy2 (28.21) 
where the wz are the covariant components of an antisymmetric tensor. 
Using the four-dimensional notation (28.08) and (28.17) for the integrals of 
motion we get in place of (28.01) 
I = —a,P° — a,P! — a,P? — a,P3 

+ Oy oL}? + WogM? + wy? + w.,M + w,M% +, (28.22) 


or, more briefly, 


3 
I= > a,Pt + PD win Mtk (28.23) 
4=0 1,k=0 

proving our assertion that J is invariant. 


29. A Remark on the Conventional Formulation of the Conservation 
_Laws 

In this section we make some remarks of a critical nature on the conventional 

formulation of the conservation laws. For definiteness we shall deal with the 

conservation laws for energy and momentum as they are the ones most fre- 


quently discussed. 
One writes expressions for the energy, or the total mass, and for the momen- 
tum in the form 
Mal? 
= = 29.01 
W = Me? Py ee Tl (29.01) 


1 — v2 fe?) 


and 
MmavVa 
P= 29.02 
> Jaa a/(1 — v2 Je?) ( 


and makes two statements about them: first that these quantities form a 
four-dimensional vector and, second, that they remain constant. 

In fact both statements are only correct in the case that the particles do 
not intcract. Then, however, the velocity of each particle separately remains 
constant and each quartet of quantities 

(a) — i (iit Se EO 29.03 
We = Time PO ayes 


is a constant four-dimensional vector. Thus, in the absence of interaction the 
constancy of the sums (29.01) and (29.02) follows trivially from the constancy 
of their separate terms (29.03) ; the summation tells one nothing new. 
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On the other hand, when interactions are present the expressions (29.01) 
and (29.02) do not remain constant and do not form a four-dimensional vector. 
The first part of this statement is easy to understand, for even in non-relativistic 
theory it is not the kinetic energy but the total energy of a system of particles 
that remains constant. 

The Theory of Relativity introduces corrections that take care of inter- 
actions not only into the expression for the energy but also into the one for 
the momentum of the system (see (27.21) and (27.24)). Only with the inclusion 
of these corrections will the two expressions be constant. As for vectorial 
properties, the following state of affairs might at first sight seem paradoxical. 
We saw in Section 25 (equations (25.23) and (25.24)) that the set of quantities 
(29.03) is, for a single particle, a four-dimensional vector even for accelerated 
motion ; further, it would seem that a sum of vectors must also be a vector. 
In spite of this we assert that the sums (29.01) and (29.02) do not represent 
the components of a four-dimensional vector. The paradox is removed by the 
consideration that the quantities W'@) and P are functions of a time common 
to all particles. Their sums W and P are in fact sums of the values of W‘ 
and P\@) simultaneous in the given frame. In a different frame of reference these 
values will no longer be simultaneous ones. Therefore, when passing to a new 
frame one has not only to form linear combinations of the W‘) and P‘@ 
according to the vector rule but must also recalculate these quantities appropri- 
ately to the new simultaneity, as was elaborated in Section 26. Such a re- 
calculation changes nothing only in the case of constant W‘), P‘@) ; the general 
case involves the change of these quantities during the time that corresponds 
to the change from old to new simultaneity, aS was shown approximately in 
Section 26. It is therefore clear that the sums W = > W™@ and P = > P‘@) 


a 
will have vectorial properties only in the case of non-interacting particles and 
not in general. 

In the presence of interactions the quantities that- are vectors and also 
constant are, to the approximation considered, the integrals of motion derived 
in Section 27. Those expressions differ from (29.01) and (29.02) on the one hand 
by their inclusion of interaction terms, on the other hand by the fact that the 
quantities m/+/(1 — v2/c?) are replaced by the leading terms of the expansions 
of these quantities in powers of v2/c*, the expansions being taken to the same 
order as in the interaction terms. 

It may, however, happen that with all particles a large distance apart, the 
interaction terms contribute negligibly but that at the same time the speeds 
of the particles are very great. In such a limiting case of weakly interacting 
particles the expressions (29.01) and (29.02) can be used but it must be borne 
in mind that they are applicable only before the beginning and after the end of 
the interaction, and not in the intervening time. 

In the limiting case stated one can use conservation laws for energy and 
momentum in the usual form 


Mac? Mal? 
>. Fa a Fi ae = Dl Fi (29.04) 


MaVa : Mave 


and 2. a= ae Ta 27 oe (29.05) 
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where vq and v% respectively are the values of the velocity of the a-th particle 
before and after the interaction. It is clear from what has been said that the 
left-hand sides of these expressions do not remain constant during the whole 
time of interaction, but that after the interaction they regain the values they 
had in the beginning. ; 

The remarks made in this section are equally true in relation to the con- 
ventional formulation of the conservation law for angular momentum, in which 
also interaction terms are not considered. 


30. The Vector of Energy Current (Umov’s Vector). 


Let us consider the familiar non-relativistic equations of motion in the 
mechanics of continuous media.. They have the form 


dv 2 Opt 
eae 1 > er 
SF Fila ok es Bi (c = 1, 2, 3) (30.01) 
ap (p04) 
and an —0 ; 
aL 2. ee (30.02) 


Here v,, v, and v, are the components of the velocities of the particles con- 
stituting the medium, ¢ is the density, p, the stress tensor} and F; the com- 
ponents of the external force acting on a unit of mass. The accelerations 
dv/dt appearing on the left-hand side of (30.01) are the so-called substantial 
derivatives, given by 


dv, mw 2. 

= = = eS > a (30.03) 
k= 

The equation (30.02) is usually called the equation of continuity. It is well 

known that it expresses the conservation of mass. 

The system of equations (30.01) and (30.02) is not in itself complete ; it 
does not allow one to determine the motion of the medium from given initial 
and boundary conditions. Two steps are needed to obtain a complete system 
of equations : first, one has to express the stress tensor in terms of other 
quantities such as deformations, velocities, pressure, temperature and electro- 
magnetic and other fields and, second, if the number of unknown functions is 
greater than four—i.e. greater than the number of equations in (30.01) and 
(30.02)—one must add to these equations some others such as the equation of 
state, the equation of heat flow, field equations, etc. 

In the following we shall confine ourselves to the case of a conservative 
system in the absence of external forces. 

We consider first a compressible elastic medium. Then we can introduce 
II, the potential energy per unit mass of the medium and express the stress 
tensor in terms of it. Let a,, a, and a, be some Lagrange variables, say the 
initial coordinates of a particle. At time ¢ the coordinates of the particle will 
then be 


2 = 24(a1, Ao, Ay, t) (¢ = 1, 2, 3) (30.04) 


+ In this section we use the terms “vector” and “tensor” in their three-dimensional 


meaning. 
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and the deformation of the medium is characterized by the set of quantities 


3 
Ann = > => (30.05) 
i=1 
The potential energy II is a function of the deformation and if one puts 


ai » Pm dAmn  (pmn = Pam) (30.06) 
Pn. n=1 
the components pi of the stress tensor can be expressed in terms of the co- 
efficients P™” as follows 


Ox OLE 
_ Pmna = 30.07 
Pik = > Was oa (pix = Prt) ( ) 


From these equations it is readily deduced that 


dll ama os all 2 Ov; 
es ae —|)= ete 30.08 
P dt o( ot + >a) ae, ( ) 
In the case of a fluid 
Pik = —psix (30.09) 


where p is the pressure, so that the tensor pi, reduces to a scalar. In this case 
equation (30.08) becomes 


ps = ~pdivv = ea (30.10) 


by virtue of (30.02), and this leads to the usual expression 
p == :! _P 
P 


(30.11) 


for the potential energy of a unit mass of fluid. For infinitesimal deformations, 
when the usual theory of elasticity applies, relation (30.08) can equally well be 
verified directly without recourse to Lagrange variables. 

It is interesting to consider the relation (30.08) in conjunction with the 
thermodynamic identity 


du a OU pes 30.12 
a 2, aa, Pp a (30.12) 


in which wu is the internal energy, o the entropy and T the temperature. For 
isothermal processes one can put 


W=u—To=F (30.13) 
where F is the free energy, and for adiabatic processes 
=u (30.14) 


We now turn to the equations of motion. Using the equation of continuity 
we can write the three equations (30.01) in the form 


— 


+ ae (evvk — pix) = 0 (30.15) 
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Just as the equation of continuity 


ae 4S Aen) 
ot a OL, 
expresses the law of mass conservation, so one can take (30.15) to express 
the law of conservation of momentum. In the equation of continuity the scalar 
quantity e is the mass density and the vector with the components pv (i = 1, 


2, 3) is the mass current density. Similarly in (30.15) the vector px is the 
momentum density and the tensor 


=0 (30.02) 


Siz = pve — pie (30.16) 


the momentum current density. So the vector pv; enters once as the mass 
current density and again as momentum density. Similarly, remembering 
that Sy, = Sx, this quantity enters once as the 1-th component of the current 
of pv; and again as the k-th component of the current of py. We have just 
seen that the equation of continuity and the equations of motion can be inter- 
preted as laws of conservation of mass and of momentum. It is natural to try 
to express in an analogous manner the law of conservation of energy. This 
was first done by Umov who introduced the important notion of energy current 
as early as 1874 [4, 5]. 
We introduce a scalar 


1 


and a vector with the components 
1 3 
Si — oo(5o0* + ett) = Pas (30.18) 


In these equations IJ denotes as before the potential energy per unit mass. 
It is obvious that the two terms in S are respectively the densities of kinetic 
and of potential energy, so that S is the volume density of total energy. The 
vector S; which we shall call Umov’s vectort can be interpreted as the vector 
of energy current. For, using the equations of motion, the equation of con- 
tinuity and the relation (30.08), it is easy to verify that 


3 
o 4 > Cela (30.19) 


This can obviously be interpreted as the conservation law for energy. 

One should note that if the equations of motion (30.15) are given, then of 
the two equations: (30.02), the continuity equation, and (30.19), the equation 
describing energy flow, only one is independent. Hither can be obtained from 
the other by using the equations of motion. Instead of either of them one could 
also take any linear combination of them. This would bring in a quantity 

+ The quantities S and S, actually discussed by Umov differed from those introduced here 
by the absence of potential energy terms. Therefore, the right-hand side of the relation 
given by Umov, which corresponds to (30.19), was not zero but the negative of (30.08), i.e. 
the work done by the elastic forces in unit volume and unit time. 


\ 
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o + AS and the corresponding current pv + AS; (with a constant A having the 
dimensions of (velocity)-?). In the following section we shall see that the 
relativistic form of the equations of motion for a continuous medium cor- 
responds approximately to choosing such a linear combination with A = 1/c’. 


In conclusion, we point out the followimg circumstance. In the foregoing dis- 
cussion the flux of a certain quantity S was determined (by means of the 
appropriate conservation law) from its divergence which was assumed given. 
This does not determine the flux uniquely. According to its physical meaning, 
however, the flux vector is.a completely determined quantity. Therefore we 
must expect that if the requirements of the theory of relativity are met, it 
should be formally possible to make the definition of the flux unique, provided 
certain additional conditions are imposed on it. One of these conditions is 
the requirement that the flux vector of the function of state S, should itself be 
a function of state. This requirement will be formulated more precisely and 
discussed in detail in the two following sections. In the examples here discussed 
the requirement is satisfied by the energy flux (Umov’s vector), the mass flux 
and the momentum flux. 


31. The Mass Tensor 

In the previous section we saw that for a conservative system the usual 
non-relativistic equations of motion of a continuous medium can be stated as 
four equations in each of which a sum of derivatives with respect to the spatial 
coordinates and to the time is equated to zero. This form of equation leads 
us to surmise that in Relativity they possess a generalization of the form 


aT 3. gTok 
OL rast OLk 
_ aT 2. aTtk 


pera | 


(31.01) 


where T#*(i, k = 0, 1, 2, 3) is some tensor. 

The first of these. equations should express conservation of mass or energy 
and the other three, for 7 = 1, 2 and 3, the conservation of momentum. If 
T® is the density of total mass, including the rest-mass and the mass of the 
kinetic energy, then cT™ is the mass current density. Also, if cT?° is the density 
of the 1-th component of momentum, then c?7** (k = 1, 2, 3) will be its current 
density. The mass M contained in some volume and the energy, W corres- 
ponding to it will be expressible in terms of 7 as the volume integral 


M= ud = [rear (31.02) 
Cc 


Similarly the momentum contained in the same volume will be 
Pi=c | T° dV (31.03) 


The equations (31.01) express the fact that any increase of the energy or the 
momentum contained in a volume is due to an influx of energy or momentum 
from outside the volume, i.e. through the closed surface enclosing the volume in 
question. The integrals (31.02) and (31.03) taken over all space are the total 
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mass and the total momentum of the system. They remain constant in time. 
In addition to the laws of conservation of mass, energy and momentum, 

conservation laws must also hold for the angular momentum and for the mass 
centre motion of the system. They can be written in a form analogous to 
(31.01). It follows directly from that set of equations that 

2 (gree en r,pT*) i =” (a Tem —-f Tim) =— [ki _ [tk 

825 LS am oa Aa 

(i, k = 0, 1, 2, 3) 

and these relations have the form of the required additional conservation laws 
if their right-hand sides vanish. Hence we get the condition 


Th = Tk (31.05) 
i.e. the tensor T** must be symmetric. This means in particular that the mass 
current must equal the momentum not only in the equations of mechanics, 


but quite generally. We shall now state the relations (31.04) for a symmetric 
tensor separately for k = 1, 2, 3 and for k —0: 


(31.04) 


a 3B ; 
ko __ 40 km __ it — 
3g (i pT) + - Bq Thm — mT) = 0 (31.06) 
é 00 a!) : é i 
and Hitt — apTi?) + a Fg eT” — oTm) = 0 (31.97) 


Here i, k = 1, 2, 3. The equations (31.06) can be considered to be the laws of 
angular momentum conservation and the equations (31.07) as the laws of 
motion of the mass-centre. In both cases these are the laws in differential 
form ; integration of the equations over some volume gives the corresponding 
integral relations. 

If the integrals 


Mé =e { (aT! — 2,7) dV (31.08) 
and Kt=+ yo — | (aT — xT) dV (31.09) 
c 


are taken over all space they will remain constant ; the quantities M2, M*}, 
and M}? are the components of angular momentum of the system and the 
quantities 11°, M?° and M®° divided by c can be interpreted as the products 
of the mass of the system and the initial coordinates of its mass centre. 

The tensor we are discussing will be called the mass tensor.+ Its invariant 
has the dimensions of a mass density. When multiplied by c? the tensor Té* 
is also called the energy tensor, its invariant then has the dimensions of energy 
density. 

We shall impose on the mass tensor another condition which can be stated 
in the form of a physical principle, as follows : The mass tensor must be a function 
of the state of the system. We must define what we mean by state of a system. 

Let us assume that the equations of motion and field equations are written 


+ We prefer this name to the often used “matter-tensor”. The notion of “ matter” is of a 
very general character and should not be identified with the notion of mass. 
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as a system of n first order equations for the unknown functions 9), ..., Qn 
and that these equations can be solved for the time derivatives of the functions. 
If initial values are prescribed for the functions 9,,..., @n this, together 
with boundary or similar conditions, determines their values at any subsequent 
time. This is the principle of causality. We shall say that the functions 
is ssey On characterize the state of the system. Any function of 9,,..., 9n 
which does not contain their time derivatives and also does not contain the 
coordinates explicitly will be called a function of the state. (We may note 
that in the Theory of Relativity derivatives with respect to spatial coordinates 
and to time enter symmetrically and therefore if some scalar, vector or tensor 
does not contain any time derivatives it will also not contain any space deri- 
vatives.) 

Let us give some examples. In the mechanics of mass points a state is 
characterized by the coordinates and velocities of the particles, therefore any 
function of coordinates and velocities is a function of the state. In hydro- 
dynamics a state is characterized by the three components of velocity, the 
density and the pressure, with the two last quantities related by some equation. 
In the theory of the electromagnetic field a state is characterized by the anti- 
symmetric field tensor. 

Thus our physical principlet requires that the components of the mass 


tensor should be functions only of the quantities that characterize the system. 
Derivatives of these quantities must not enter nor should the coordinates 
appear explicitly. (We assume of course that cartesian coordinates are used.) 

We now inquire to what extent the mass tensor is determined by the con- 
ditions we have stated. We take first the equation expressing the fact that the 
divergence of the mass tensor vanishes 


2. [ik 
=0 (31.10) 


Ox 
=o: 


together with symmetry condition T’* = T*!, In addition we make the con- 
dition that the ten integrals of these equations, i.e. W, Pé and M‘* should have 
fixed values. 

Let A‘™.2k be a tensor of fourth rank having the following symmetry pro- 
perties : antisymmetry with respect to the first pair of indices 


Aim, nk — +. qmi, nk (31.11) 
antisymmetry with respect to the second pair 
Aim, nk — __ dim, kn (31.12) 
and the cyclic symmetry 
Aim, nk 41 Ain, km 1 Aik, mn — () (31.13) 
From these properties it follows that , 
Aim, nk — Ank, im (31.14) 


In terms of second derivatives of Aim. we can construct the tensor 


{It seems that this principle has never been explicitly formulated. However, it is 
satisfied by all forms of the mass tensor in common use. 
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3 , 
1 o2Aim, nk 
: B= Y= (31.15) 


OL, Ox 
mn, W=-0 me re 


which we shall call-Krutkov’s tensor.t It is easy to see that it is symmetric 
and that the antisymmetry.of A’™.™* in » and k results in the identity 
2. aBik 
a (31.16) 
k=0 
In addition, Krutkov’s tensor has the following property. If the tensor Té* 
is replaced by Bé* in the volume integrals of the form of M, Pi, M‘* and Kt 
(equations (31.02), (31.03), (31.08) and (31.09)) these all reduce to surface 
integrals. The integrals formed with B‘* will thus all vanish if the tensor 
Aim,nk and its first derivatives become zero on the surface enclosing the volume, 
or, when the volume considered is all space, if they tend to zero sufficiently 
fast at infinity. 
Let 7** be a given mass tensor. Adding to it a Krutkov tensor we get a new 
tensor e 
Tik = Tik 4 Bik (31.17) 
which has the following properties. Firstly, it will be symmetric, secondly, 
by (31.10) ‘and (31.16) it will satisfy the equation 


Laffit 

— t 
> =0 (31.18) 
£20 CLK 


* * * * * 
Thirdly, the integrals M, P¢, M‘* and Kt formed from Té* will be cqual to 
similar integrals formed with the given tensor T**. 
Thus, apart from the condition that the mass tensor should be a function 


of the state, the tensor Hk obtained by (31.17) from T** will satisfy all require- 
ments. In other words, the mass tensor would not be determined uniquely 
if no further condition were imposed. 

However, the condition that the mass tensor should be a function of the 
state completely changes the situation and ensures that the definition of the mass 
tensor is unique in principle. The following circumstance can be considered an 
indication of this. Let us assume that the tensor 7?* is a function of the state, 
so that it satisfies all the conditions imposed, including the requirement just 
discussed. Even if the fourth rank tensor A‘™, "* is also a function of state, its 
second derivatives will not possess the same property and addition of the 
quantities BY to T** is not permissible. It is true that these considerations 
cannot be considered a proof, but a full proof of our statement can be con- 
structed by other means. We shall discuss this in the following section. 

The uniqueness of the mass tensor becomes particularly important in the 
theory of gravitation because the equations of this theory contain the mass 
tensor itself, not only its divergence. 


+ In three dimensions a fourth rank tensor A?™.”* with the properties given reduces to a 
symmetric tensor of second rank with the components y,, = A,%8, Vig SA, Ys3 = 
Al2,12, 4, == A51,12, 4, = A128, yy. = A23,31, The tensor pq Was introduced and widely 


used by G. Krutkov in his book [10]. 
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When expressing equations of motion as the vanishing of four-dimensional 
divergences we imply that the physical system tmder consideration is con- 
servative. A general remark on non-conservative systems should be made. 

As the law of energy conservation is a universal one, a non-conservative 
system should be regarded as one in -vhich some forms of energy, such as heat, 
have not been explicitly taken into account. If there is some form of energy 
partaking in the general balance but not included in the tensor’ 7** the diver- 
gence of this tensor will not vanish; the equations (31.01) will then have on 
their right-hand sides the flux into unit volume of this form of energy, and of 
the corresponding momentum. In the following we shall deal only with conser- 
vative systems. 


31*. A System of Equations for the Components of the Mass Tensor 
as Functions of the Field + 
Let us assume that the state of a physical system is described by the set of 


functions ©, @.,.--, Qn. This set of functions may be called a field and the 
equations of motion relating these functions may be called field equations. 
We have already given examples of physical fields (the fields of velocity, density 
and pressure, the electromagnetic field, etc.). 

The statement made in the previous section concerning the mass tensor can 
be made more precise in the following manner. The symmetric tensor T**, the 
components of which are functions of state (i.e. functions of the field), and whose 
divergence vanishes as a consequence of the field equations, is determined by 
these equations (which are purely local) apart from two constants. The tensor 
Tt* will be defined apart from a constant multiplying factor, provided local 
conditions are supplemented by a condition at infinity. This latter usually 
amounts to the requirement that the mass tensor should become zero where the 
field is zero (i.e. where all or some of the functions ~; become zero). The con- 
stant factor evidently depends on the choice of energy or mass units, and its 
value is determined if these units are fixed. 

Thus finally the mass tensor is uniquely determined for every physical 
system. 

The proof of this statement can be given in each individual case by considering 
two systems of differential equations satisfied by the components of T#* con- 
sidered as functions of 9), po, ..., Qn. The first of these systems expresses the 
fact that 7%* is a tensor and the second the fact that the divergence of this 
tensor vanishes. 

To derive the first system of equations, let us examine an infinitesimal 
Lorentz transformation. 


Equation (23.06) shows that in such a transformation the components of a 
vector receive the increment 


3 
dAt —= > epwtk Ak (31*.01) 
k=0 
+ This section and Section 49* have been added to the second Russian edition. To pre- 
serve the numbering of the original edition, equations appearing in these two sections are 
marked with asterisks. In the first English edition part of the present Section 31* was 
included at the end of Section 31. 
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The corresponding relation for the increment of tensor components is 


3 3 
ST — > egutmTmk + S emeokmTim (31*.02) 


m=0 m=0 


Since the functions of state @s are also subject to a definite transformation law 
(they are always expressed in terms of some scalars, vectors and tensors), they 
will also obey relations of the form 


13 
d95 = 5 > e1€ ment (31*.03) 
1, m=0 
or 
12 
305 =e wimp (31*.04) 
where 
ym = ym (31*.05) 
are some known functions of 91, Q2, .-., Qn. 


We consider certain general properties of the coefficients W{™. 1t can be 
shown that they satisfy the relation 


OYE OYE 
(ves Ta e = € pS mp4 a C48 mg?! + epdipyd™ + egdigh?? 


Op OPr 
(31*.06) 


To prove this we note first of all that this relation is functionally invariant in 


the following sense. If in placc of the functions 9, o2, ..., on we introduce 
* * * 
new functions of state 91, 93, ..-.-, Qn (each set being ex presible in terms of 


the other) and if we also introduce corresponding coefficients yin related to the 


oe by 
Z 
= > yn (31*.07) 


=1 


then equation (31*.06) will be satisfied for the starred quantities. On the other 
hand relation (31*.06) is easily verified directly if the @, form a scalar, a vector 
or a tensor (or a set of scalars, vectors or tensors). Since ultimately all functions 
of state must be expressed in terms of scalars, vectors and tensors it follows 
from the functional invariance of (31*.06) that this relation is generally valid. 
We shall not go into the detailed exposition of the manipulations involved in 
this proof. 

The foregoing equations can be written in a more elegant form by introducing 
the operators X’™ defined by the equations 


n 
sim ( 31*.08 
x = ee = ( ) 
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Then we have, in particular 

Xm (95) = h™. (31*.09) 
Using (31*.06) it is easy to prove the operator equation 


XvaXtm — XimX va = eyBmpX'4 + eqdngX?! + epBipXI™ + ead1aX™? (31*.10) 
If this equation is applied to a function gs, relation (31*.06) follows conversely 
from it. 


The change of any field function (i.e. any function of the 9,, @,..-, Qn) in 
an infinitesimal Lorentz transformation can be expressed in terms of the 
operators X!m, Indeed, if f= f (9, ..., @n) it follows from (31*.04) that 

i 2 
i > O1mX'™ (f) (31*.11) 
2 m=0 


Putting f = ¢s and using (31*.09) we return to equation (31*.04). The formula 
(31*.11) is convenient because of the fact that it is independent of the choice 
of functions @,, ..., Qn. 


We apply equation (31*.11) to one of the components of T**. We have 
3 


1 ; 
87 = 5 > CoimX'm Tik) (31*.12) 
i, m=0 
On the other hand, since T** is a tensor, equation (31*.02) holds; it can be 


written in the form 3 


STi = > wrmlediuT™ + e,3x1Tim) (31*.13) 
i, m=0 
Both expressions for 7#* must be equal identically, i.e. for any choice of anti- 
symmetric quantities wim. Equating the antisymmetric part of the coefficients 
of the w;m in the two expressions we obtain 
Xin Ttk) = eda lk — eSimT + epdy.Ti™ — ep8emTt (31*.14) 


or, in more detail 
Dn = edal™™ — edimT™ + e3uT™ — e3pmT% (31*.15) 


This is the first of the two aforementioned systems of differential equations for 
the 7? as functions of the field. 

The second set of equations is obtained from the condition that the diver- 
gence of the mass tensor vanishes. It follows from this requirement that 


3 n : 
—* oTtk CDs 
a 0 * 
>. > Ops Orr oT 16) 


and this equation must hold for all values of the 9s and of the 05/8, which 
are connected by the equations of motion (and possibly by other relations, 
if not all these quantities are independent). When the 9; and the dps5/ zz are 


expressed in terms of independent quantities equation (31.27) must reduce 
to an identity in these latter variables. Since the 7* contain only the functions 


ps themselves, but not their derivatives with respect to the «, this identity 
leads to a set of relations for the dT/dgs5 which involves only the g, not the 
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0¢r[0%m. These relations are just the second system of equations for the mass 
tensor. We note that both systems are satisfied by the values 

Ttk — emtK (31*.17) 
where A 1s @ constant. 

The complete system of equations for 7'%* consists of these two sub-systems: 
equations (31*.15), which express the fact that the 7** form a tensor and further 
equations which state that the divergence of this tensor vanishes; the latter 
are obtained from (31*.16) by elimmating the derivatives d05/0x; with the aid 
of the field equations. The complete system can be discussed in a simple form 
in particular cases that correspond to given physical systems. When this is 
done, the uniqueness of the mass tensor is confirmed every time. Examples of 
uniqueness proofs are given in Appendices B and C. Further examples can be 
found in the literature [11]. 

To conclude this section we shall show that if the mass density 7° is known 
as a function of the field 9), ..., @x, equations (31*.14) or (31*.15) also permit 
the unique determination of all other componcnts of the mass tensor. Indeed, 
if we assume that 7° is known and if in (31*.14) we put i =hk=1=0; 


m = 1, 2,3 we get qom — Tmo — 1 Xom Too) (31*.18) 


where the operator X°™ has the value given by (31*.08). Thus, the mass flux 
density has been expressed in terms of the mass density. Further, putting 
4=1=0;k, w= 1, 2, 3 we get 


Tmk — XomTor) _ §,,.T00 (31*.19) 
or, using (31*.18), 
[mk — (LX0MXok _ 3,44) T (31*.20) 


This expression is in fact symmetric in m and k. Indeed, we have the operator 
relation 
Xomxok _ Xok yom _ Xmk = (m, k = 1,2, 3) (31*.21) 


and from (31*.14) it follows that the result of applying this operator to 7°? is 
zero (this last fact means that 7 is a three-dimensiona! scalar). 

Thus in order to obtain all components of the mass tensor it is sufficient to 
know the component 7°°. 


32. Examples of the Mass Tensor 

In this section and the following we shall discuss the explicit form of the mass 
tensor in some concrete cases, without dwelling on the proof of its uniqueness. 
Examples of such proofs will be given in Appendices B and Co 

We begin with the simplest case of “ dustlike” matter, by which we mean 
an assembly of non-interacting particles which nevertheless have a con- 
tinuous variation of velocity so that a velocity field can be defined. In this 
case, we introduce the special notation ©** for the mass tensor. We denote by 
o* the invariant mass density, i.e. the density in that frame of reference relative 
to which the particles of the particular volume element in question are moment- 
arily at rest. This frame might also be described as travelling with the particles 
of this volume. Let ué be components of the four-dimensional velocity of the 


particles. We put Dac 
(tk = —o*uiuk (32.01) 
Cc 
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By definition ©* is a four-dimensional contravariant tensor. Its zero-zero 


component. is 


@oo _ eT ee (32.02) 
= i = Tile : 


This component must be equal to the density of total mass, including the 
mass of the kinetic energy. We shall show that this is so : if p* is the density 
of rest-mass in the frame moving with the particles, then the density of rest- 
mass in the “laboratory frame” relative to which a particle moves with 
velocity v, is 

* 


= p 
PS V1 — ve?) 
Further, if p is the rest-mass density, the density of the total mass, including 
kinetic energy, is 
ic sg = 
/(1 — o/c?) 1 — vc? 


This expression for density corresponds to the usual expression 


= @% (32.04) 


nm 


M = ——__— 32.05 

Va — oF vee 

for the mass of a single particle. The other components of the mass tensor 
are, in three-dimensional notation 


: 1 o* ev; 1 Pvt 
(ot = —— 32.06 
el—ve? e4/(1 — v?/c?) vee 
1 ot UE 1 OVW 
and Git = ee ee SI ee 2. 
ce? 1 —v%e2 2 4/(1 — v4Jc?) ee) 
Let us form the divergence of the tensor @¢#. We have 
ee ae A(p*uk) — * 2 aut 
es pee at . a ha 
Zz rane ce - > Or, a he » . Ory, oe 
k=0 k=0 
We introduce special symbols for the sums in (32.08), putting 
* S Aptut) 2 
Q > a (32.09) 
3 at 
and wt == Ss qe (32.10) 
pay. Oe 
These can also be written as 
co 
Q* == Ss r div (pv) (32.11) 
1 cut : Cut 1 dut 
and of SS ve —— | = ——_ — 2.12 
/(1 — v®Je2) ( a >, F = (1 — v%Je2) de eet) 


where d/dt is the substantial derivative. This shows that the invariant Q* 
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is the rate of increase of rest mass in unit fluid volumet and the vector w? is 
the four-dimensional acceleration, the space components of which become the 
ordinary acceleration in non-relativistic approximation. Thus 


“. aQ@ik = Q* 


2 


*K 
ut + 2 (32.13) 
— Or, c c? 


By the equation of motion this expression must be zero. But we have the 
identities 


3 


> uu = c?; >) uw = 0 (32.14) 
i=o i=0 
so that the equation 
Q*ul + pF wt = (32.15) 
is equivalent to the two separate ones 
Q*=0 and wi= (32.16) 


The first of these is the equation of continuity which expresses the constancy 
of the rest-mass of the particles. In the case we are considering the rest-mass 
does not change because the particles do not interact and their internal energy 
remains constant. The second equation of (32.16) expresses the constancy of 
the four-dimensional, and hence also of the three-dimensional, velocity. That 
non-interacting particles move with constant velocity is, of course, physically 
obvious. As the equations of motion are of first order in p* and zu? and as O%* 
is expressed as a function of these quantities this mass tensor is clearly a function 
of the state. Therefore the form of the mass tensor for dustlike matter may be 
considered established. 

Let us now seek to generalize the equations of perfect fluid hydrodynamics. 
In non-relativistic theory this case is characterized by the stress tensor 
reducing to a scalar. This readily permits a relativistic generalization. Let 
us assume that the energy tensor, i.e. the mass tensor times c*, has the form 


7e a ) l- td 
at Nae (u* : Z )utut — perdix (32.17) 
where u* and 7: are four-dimensional scalars connected by the relation 


rie (32.18) 


In the frame, relative to which the velocity at the point and at the moment 
in question is zero, the component 7° of the mass tensor is equal to u*. There- 
fore w* is the rest-mass density of the fluid. As the fluid is assumed elastic 
and can possess compressional potential energy, this rest-mass includes the 
mass corresponding to that energy. Since the compressional energy may 
change, the-rest-mass of a volume of fluid. will not stay constant. With this 
act in mind we introduced a special symbol w* for this rest-mass density leaving 
the symbol p* to refer to that part of the rest-mass density which does not 
change in the course of the motion. 


+ By “ fluid volume ” we mean the volume of an element of fluid always consisting of the 


same particles. 
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We now state the equations of motion. Putting for brevity 


3 
é P 
= a * 1S )yk 32.19 
te Dea, ( #3) Sie 
k=0 
and using the notation w* for acceleration as before, we have 

= aTik foseoeoP op 
Oa = Qui 4- (u* + a — ej — (32.20) 

2 c? 0x4 


In the absence of external forces this expression will be equal to zero. 
Using equation (32.14) we can get from (32.20) another expression for Q, 
namely 


ese if 4 1 dp 
=x oo Er 32.21 
oF ae dry 0? /(1 — vc?) dtc? d= ( ) 


Here dp/dt is the substantial derivative of p and dz the differential of the 
particle proper time. As for 7: itself, a comparison of these equations of motion 
with the non-relativistic ones shows readily that it is the pressure. Equating 
the two expressions for Q we obtain 


oe ie ag 39.99 
> (e+ *) im St are) 


As we assume u* to be a function of p we can introduce a new quantity 
e* by the differential relation 
do* du* 
= = = 32.23 
p> ™ + pie? wee 
We can choose the constant of integration in such a way, say, as to make 
e* =u* when p= 0. Using (32.23) we can write (32.22) as 


og 
>, Bay (o*ut) = 0 (32.24) 


Hence we see that p* can be interpreted as the invariant density of that part 
of the rest-mass which does not change in the motion. This quantity, like 
u*, is a function of p. 

Let us put 


Tl 
ee e*( ee =) (32.25) 
c 
Then we get from the differential relation between u* and p* that 
1 Padp* 
dil = — (32.26) 


p 
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whence 


| The quantity I can be interpreted in analogy with (30.11) as the potential 
energy of a unit mass of fluid, with mass meaning that part of the rest-mass 
which does not vary during the motion. Expressing w* in terms of p* and IT 
we can write the energy tensor as 


1 
etTtk — le + = (o* fT + »)| uluk — pexdix (32.28) 


while the equations of motion appear as 
1 op ldp 
* 4. _ (5*[] wi =e,— — —— ui 32.29 
[ett set +n] wae 2 (32.29) 
The equation for the zero component of acceleration can be replaced by the 
equation of continuity in the form (32.24). 
One may note that the invariant of the mass tensor is 
3 
\ 3p 3p 
> ett = e(t rs 5 mel ea eee (32.30) 
i=0 
Comparing the equations of motion (32.29) with equation (32.28) for Té# one 
sees that the mass tensor J?* is a function of the state, as it should be. 
We now write down the components of the mass tensor in non-relativistic 
approximation, but retaining in T° and 7 terms of an order of 1/c? compared. 


With the main terms. In the main terms we put 


oe 
*=p—-9-— 32.31 
p= 5 Os ( ) 
in agreement with (32.03). Here o is the usual density that satisfies the equation 
of continuity (30.02). We then have 


1 fi 


i 1 
To — : ov, + 5 (5 ov? + oll + ») (32.32) 


1 
Lik = 5 (vive + pdx) 


Comparing the correction terms in T% and T% with Umov’s scalar and vector 
(30.17) and (30.18) we see that the components T° and T% of the mass tensor 
can be written as 

1 


] 1 
S; T% = py +5 S; (32.33) 
c* c c 


TO =o + 


Thus Umov’s scalar and vector give the second order relativistic corrections 
to the usual expressions for mass density and momentum density. As for the 
spatial components of Té, they are proportional to the _three-dimensional 
tensor of momentum current density which was considered in Section 30. 

Using these results we can also write down approximate expressions for the 
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mass tensor for the case of an elastic body. We have 


1/1 
00 __ pe Il 
T e+ (50° +l!) 
poi — } v male : oe + oll} — MUKVE (32.34) 
ree ) i 3° P F 2 
=1 


] 
Tk — a (evve — pix) 


If here we put pix = —pdz as in (30.09), we get back to the case of a perfect 
fluid. 

We shall not discuss the relativistic formulation of the theory of elasticity. 
We shall only make one remark concerning the possibility of using the concept 
of an absolutely rigid body in Relativity. In non-relativistic mechanics this 
concept is introduced as an abstraction, according to which the shape and 
size of such a body does not change, whatever the forces acting on it. In 
particular, a blow imparted at some instant to one end of an absolutely rigid 
body immediately sets the other end of the body in motion. In fact, in a 
physical solid body the blow produces a wave in the body that spreads with the 
speed of sound. Therefore the abstraction implies the assumption that the 
speed of sound may be treated as infinitely great.t However, it is evident 
that if the speed of sound is assumed infinitely great, the speed of light, which 
is actually some hundred thousand times greater, must also be taken as infinite. 
It is thus clear that the abstraction involved in defining an absolutely ngid 
body is permissible only in a non-relativistic theory. As the Theory of Relativity 
is based on the fact that any kind of action propagates with finite speed, this 
abstraction must inevitably lead to contradictions. Therefore the notion of 
an absolutely rigid body may not be used in Relativity Theory. 

However, this does not preclude the use of the notion of a rigid measuring 
rod in discussions of Relativity. For this notion merely presupposes the existence 
of rigid bodies whose shape and size remain unchanged under certain particular 
external conditions such as the absence of accelerations or impulses, constancy 
of temperature, etc. Such rigid rods can be realized with sufficient accuracy 
by solid bodies existing in nature and they can serve as standards of length. 
As we showed in Section 2 the constancy of their length can be checked, say 
by comparison with the wave length of some spectral line. 


33. The Energy Tensor of the Electromagnetic Field 

In the preceding sections we discussed the mass tensor, and the energy 
tensor which is proportional to it, for a substance consisting of particles inter- 
acting by means of elastic forces. We now consider the energy tensor for a 
substance of particles interacting only by means of the electromagnetic field. 
As we are dealing with macroscopic theory we can imagine the substance to 
be a continuous medium with a continuous charge distribution. 

We start from Maxwell’s equations in the form given by Lorentz. We put 
for the components of the electric and of the magnetic field 


t One reaches the same conclusion if one considers the absolutely rigid body as the limiting 
case of an elastic body in which the elastic moduli are infinite. 
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Ey = Fy; Ey = Faq; E, = Fy, 


H, = F.;; |; ee re yee ee 20H) 


As we saw in Section 24, the four-dimensional statement of Maxwell’s equations 
1s : 


OP ix OF x OF y 


PS a 
ikl Oa, ae on (33.02) 
F 3. gFtk 
an —— = st (33.03) 
kao OUR 
where the Ptk — ee. Fy, (33.04) 


are the contravariant components of the antisymmetric field tensor and where 
the four-dimensional vector sé has the components 


4 
S=4r; st=—f—=—py (6 =1,2,3) (33.05) 
Cc 


Here p denotes the charge density, v‘ the three-dimensional velocity and 
js = py the electric current density. Introducing the invariant charge density 
e* and the four-dimensional velocity u‘, we may replace (33.05) by 


4 
st= —p*ut = (iG = 0, 1,2, 3) (33.06) 
c 


We saw in Section 25 that the equations of motion of a particle of charge e 
and rest-mass m is of the form 


3 
e 
Bt ears Fupuk 33.07 
mw; v2 Ku ( ) 


Here we have on the right the Lorentz force acting on the charge e. To 
pass from a single particle to a substance with continuous charge and mass 
distribution we must introduce the invariant charge density p* in place of the 
charge and the invariant mass density 2.* in place of the mass. We then get 


3 
e* 

if) oer NS k 33.08 

u* wy; ; > Pixu ( ) 
k=0 
1 3 

dyes 2 Se Sy gk 33.09) 

or [Ln U4 Ae L is ( 


As we have introduced two new functions, p* and p* which are not necessarily 
proportional, we must adjoin to the equations of mction twe further equations 
for these functions. The equation for p* is already implied by Maxwell’s 
equations and is an expression of the law of charge conservation 


3 

F,) KK 

> EE) di (33.10) 
OxLk 

k=0 


The equation for y* must be stated separately. We assume that the rest- 
mass of the particles does not change in the motion ; (this assumption implies 
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that no Joule heat is developed). Then we have 
3 


gay HY Wo (33.11) 


Ox 
k=0 BE 


Of the equations of motion (33.09) only three are independent, because we 
have the identity 


3 * 3 
ae > ulwy = a > Fyutuk = 0 (33.12) 
i=0 t,k=-0 


Thus the equations of motion (33.09) together with the equation of rest-mass 
conservation represent four independent equations. Our problem is to write 
them in the form of the divergence of some tensor. To do this we first use 
equation (32.13) according to which 

3 3 
u*wi + Q*ul = c? oot 
Ox, 


k=0 


(33.13) 


1 
where O% = — p*utuk (33.14) 
c 


(Note that now the invariant rest-mass is denoted by u* and not by p* as in 
(32.01)). On the other hand the equations (33.09) and (33.11) can be combined 
into the four independent ones 


1 3 
*wy + OF uj = > Fyysk (33.15) 
An er 


which are equivalent to (33.09) and (33.11), as a consequence of (33.12). The 
left-hand sides of the last set of equations, or rather their contravariant forms, 
have already been expressed as the divergence of a tensor. It remains there- 
fore to do the same with the right-hand sides. 

For this purpose let us consider the tensor 


1x 1 . 
Vin = -- re 2 Cm imP km + 6x eas D, FoF ns (33.16) 


and denote by f; the negative of its divergence. We have then 


3 
= Wiz 
fi=- Ze e (33.17) 
Performing the differentiations, some simple calculation gives 
1 oFH 1S 
fee pene, Pipes oe) os PKLR 
i re ar » FYB yy (33.18) 
k,t=o k,l=0 


where the Fi: are the quantities defined in (33.02). By virtue of Maxwell’s 
equations (33.02) and (33.03) we find 


] 3 
fi=- > Fixs* (33.19) 
4n > 


which is the Lorentz force as it appeared on the right-hand side of (33.15). 
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Thus both members of (33.15) have been represented in the form of divergences. 
Passing to contravariant components we can write the equations 


p*wt + Fut — fi = 0 (33.20) 
2. Lik 
as <i 
ia = 9 (33.21) 
k=0 
where ckTik — or? Qtk + [ik (33.22) 


is the energy tensor of the whole system, comprising the material medium and 
the field. In the last equation the first term on the right is the energy tensor 
of the inedium and the second that of the field. 

The tensor 7** involves the invariant density u*, the velocity components 
ut and the field components Fz. Since these quantities characterize the state 
of the system, this mass tensor is evidently a function of the state. 

We now examine the field energy tensor in a little more detail. 

In three-dimensional notation the components of the energy tensor of the 
electromagnetic field have the following form 


1 
U% = Ugg = — (Et + HY (33.23) 
TT 
ee | eT OR 33.24 
= —U,= DIE x Hy (38.24) 
; 1 1 
and Uik — Ui == = (iE x + HiHx) + a OK (£2 + H?) (33.25) 
TT 


We note that the invariant of the energy tensor vanishes. 
3 
> equa = 0 (33.26) 
i=0 


As always the component U™ represents the energy density, and the com- 
ponents U% multiplied by c the energy current vector, i.e. Umov’s vector. 
It can be written as 


= = [E x H] (33.27) 


and is usually called the Umov-Poynting vector, or simply Poynting’s vector, 
because Poynting was the first to give the explicit expression for the energy 
current density of the electromagnetic field resulting from Maxwell’s equations. 

Quite generally, the energy current divided by c? is equal to the correspond 
ing mass current and the mass current is in turn equal to the momentum. 
Therefore the Umov-Poynting vector divided by c? is equal to the electro- 
magnetic momentum density. The fact that the electromagnetic field carries 
momentum gives rise to the forces of radiation pressure first discovered experi- 
inentally in 1900 by P. N. Lebedev [13]. Equations (33.25) show that if a 
body is situated in an electromagnetic field a surface element do normal to the 
a;-axis is acted on by a force with the components 


Piet Uade. (eS 1298) (33.28) 
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We consider a totally reflecting body, part of whose boundary surface is 
flat. Let the equation of this flat part be x, = 0, with the body extending into 
the region x, > 0. We assume that a plane wave of the form 

EY = —sindf HH? = sindg 
E% = cosdf HS = —cosdg (33.29) 
BB = 9 Hy =f 


impinges on the body, where 


saa _ . (x, cos } + x, sin »)) 
; (33.30). 
g=a( or (x, cos } + x, sin »)) 


We add to the field (33.29) the reflected wave which is obtained by. replacing 
9 by m — 9 and g by —g in the preceding equations and so get the complete 
field, satisfying all boundary conditions. The values of the field components 
on the surface of the body will be 


BE, = —2sndf H,=0 
E,=0 H, = —2 cosdq (33.31) 
E,=0 H, = 2f 


where the functions f and g are taken at 2, =0. Inserting these values in 
(33.25) we obtain for the force acting on unit surface area of the body 


1 
BH Un =o ges: => Ue 0. Fy =U =0 - (83.32) 


Hence, whatever the direction of the incident wave, the force acts in the 
direction of the inward normal to the surface and thus is a normal pressure 
(see [12]). 

Let us now consider another case. Let the electromagnetic field consist of 
a superposition of waves, uniformly distributed over all directions. Then the 
statistical mean of the products of field components will be 


ae ae 
kik, = 3 BSin; Hi, = , AS, (33.33) 


The space components of the energy tensor will then be’ 
7 des exes, | fs Dez 
UE = ve (BE? + Ai = = Ug dtx (33.34) 
4x 3 


In this case the electromagnetic field produces an isotropic pressure equal in 
magnitude to one-third of the electromagnetic energy density. This result is 
of great importance in the theory of black body radiation. 


34. Mass and Energy 

According to the terminology in use in mechanics the mass of a body is a 
measure of its inertia (inertial mass). On the other hand the word “ mass ”’ is 
also used to mean the ability of a body to produce a gravitational field and to 
experience forces in such a field (gravitational mass). Inertia and the property 


The Theory of Relativity in Tensor Form 11] 


of producing a gravitational field are entirely distinct manifestations of the 
nature of matter. It is, however, no accident that the measures of these two 
distinct manifestations carry the same name. The reason is that both properties 
are always present together and are always proportional to each other so that 
with a suitable choice of units one.and the same number can be used to measure 
them both. The equality of inertial and gravitational mass is an experimental 
fact which has been verified with enormous accuracy by Eétvés, but the defini- 
tions of the two concepts are quite different, corresponding as they do to quite 
different manifestations of the properties of matter. There exists a physical 
theory, namely Einstein’s Theory of Gravitation, in which the fundamental 
law of the equality of inertial and gravitational mass is accounted for auto- 
niatically in the sense that one and the same constant appearing in the solution 
of Einstein’s equations figures as inertial and as gravitational mass. 

How should one answer the question : are inertial and gravitational mass 
the same thing or net? Their manifestations are different but their numcrical 
characteristics are proportional. Such a state of affairs is customarily des- 
cribed by the word “ equivalence ”’. 

An analogous question arises in connection with the concepts of mass and 
energy. (For definiteness we shall discuss the inertial mass.) We have just 
recalled the definition of mass. Energy is usually defined as the ability to do 
work. For its definition the essentials are, first, the law of conservation of energy 
and, second, the ability of different forms of energy to transform into each other. 
Both together are known as the law of conservation and transmutation of 
energy. Because this universal law is valid, a measurement of any kind of 
energy can be reduced to a measurement of one particular kind, e.g. mechanical 
energy, and one and the same unit, e.g.'a mechanical one, can be used to 
measure all kinds of energy. 

Thus the manifest properties of matter corresponding to mass and to energy 
are undoubtedly different. However, the Theory of Relativity asserts that 
mass and energy are indissolubly connected and are proportional. Any change 
of energy of a system is accompanied by a change of inertial mass. This is 
true not only for the kinetic energy of a body in a change in which rest-mass 
remains unaltcred, but also for changes of all kinds of internal energy in which 
the rest-mass also changes. 

Phenomena are known in physics in which the whole of the energy corres- 
ponding to the rest-mass of a body can transmute into radiative energy, which, 
of course, possesses the samc mass. Conversely rest-mass energy may be created 
at the expense of radiative energy. An exaniple of this is the conversion of an 
electron-positron pair into a gamma quantum and the reverse phenomenon 
of the creation of such a pair by a quantun. 

In the preceding sections we have studied in detail the relation between 
mass and energy. We saw that to any energy W one should ascribe a mass 
M = W/c? and to every mass one should ascribe an energy W = Mc?. The 
two quantities are always proportional and if they are both expressed in the 
same units, say those of energy, they may be measured by the same number. 

Further, we saw that the energy tensor differs only by the factor c? from 
the mass tensor and that the law of conservation of energy is at the same time 


the law of conservation of mass. 
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Thus to the question posed above whether mass and energy are the same 
thing or not we can give the same answer as to the question of the relation of 
inertial and gravitational mass. The properties of matter manifest as mass 
and as energy are different but their numerical characteristics are proportional. 
Thus in this case we also can talk of equivalence-the equivalence of mass and 
energy. But it is better to call this fundamental law simply the law of pro- 
portionality of mass and energy. 

We have just stated that the law of energy conservation is at the same 
time the law of mass conservation. A question arises immediately. Experiment 
shows that for an enormous majority of known physical processes the mass of 
a body, determined by weighing, and its energy, determined by work done, are 
conserved separately. One observes two conservation laws. How does this 
relate to the fact that in the Theory of Relativity only one law is formulated ? 

This question can be answered as follows. There is but one rigorous con- 
servation law, which applies to the entire mass of a body M and to the cor- 
responding entire energy W = Mc. But an overwhelmingly large part of the 
energy (and of the corresponding rest-mass) does not usually take part in 
transformations and is conserved separately. It follows then that the remain- 
ing part of the energy, which is active in transformations is also conserved 
separately. 

The division of energy into a “ passive”’ part which does not undergo any 
transformation in a process and an active part which is capable of changing 
into different forms can be followed through in the examples of the hydro- 
dynamical equations discussed in Section 32. There, by use of the frame 
travelling with the fluid particle in question we separated out the kinetic 
energy and then considered two rest-mass densities, the total density p* and 
the density o* of the passive part. The two quantities were connected by (32.35) 
which gives 


1 
us = p* + a o* IT (34.01) 


where the second term on the night is the density of compressional potential 
energy divided by c?. 

The division is still more clearly apparent in the approximate equations 
(32.32) and (32.34) which give the total mass density 7 and the tctal mass 
current density cT%. By (32.33) these equations can be stated as 


1 1 
TO = 5 + 3 S; cT™ — ov; + 3 Si (34.02) 


Here p and ov; are the density and current density of that part of the mass 
which does not take part in any transformation while Umov’s scalar and 
vector represent the density and current density of the active part of the 
energy, or, when divided by c?, of the active part of the mass. 

We spoke above of the fact that the total mass of a body as determined by 
weighing, including the variable part, is practically conserved although the 
body may emit or absorb energy. This is explained simply by insufficient 
accuracy of weighing and by the fact that in ordinary bodies a predominant 
part of the mass is passive. A change in the active part, on the other hand, 
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can be detected with much greater accuracy by measuring the corresponding 
part of the energy, i.e. by calorimetric methods rather than by weighing. 

It is natural to inquire into the deeper reason for the fact that in normal 
conditions the predominant part of the energy is bound so durably as to be ina 
completely passive state. Why does even a negligible part of it never leave 
this state and destroy the separate balance of the active part? The Theory of 
Relativity by itself is unable to answer this question. One should look for the 
answer in the domain of quantal laws, which have as one of their main features 
the .existence of stable states with discrete energy levels. With elementary 
particles the energy corresponding to their rest-mass can only transform into 
the active form of radiation as a whole, or not at all. A small loss of mass is 
impossible. This has been verified expcrimentally in the electron-positron 
case but is also believed to be true for all other elementary particles. As the 
predominant part of the mass of atoms has the form of the mass of elementary 
particles the impossibility of a small change of mass must also hold for atoms. 

Thus the reason for the particular stability with which the passive part 
of the energy is bound is of quantal character. 

It is worth noting that the division of energy, or mass, into an active and 
a passive part is of a relative nature. In ordinary chemical reactions not only 
intranuclear energy but also the energy of the inner electronic shells behaves 
passively. At very high temperatures, when a complete or nearly complete 
ionization of atoms becomes possible, the energy of the inner electron shells 
becomes active. Finally, in processes involving rearrangements within atomic 
nuclei the intranuclear energy becomes active as well. However, even then 
the energy corresponding to the rest-masses of the heavy elementary particles 
that form the nucleus remains in a passive state. 

The particularly durable binding of the predominant part of all energy, or 
mass, is the reason why one can speak of the laws of conservation of mass and 
of energy as two separate laws, although in Relativity the two laws coalesce 
into one. 

The law of mass conservation in chemical reactions was discovered and 
experimentally proved by Lomonosov and then confirmed by Lavoisier. As 
for the law of energy conservation, its precise formulation was only given in 
the nineteenth century by R. Mayer. However, Huygens in the seventeenth 
century and Johann and, especially, Daniel Bernoulli in the eighteenth century 
already used it in mechanics and Lomonosov was aware of its general character, 
as can be seen from his famous letter to Euler written m 1748. 


CHAPTER III 


GENERAL TENSOR ANALYSIS 


35. Permissible Transformations for Space and Time Coordinates 


As the basis of our mathematical formulation of Relativity Theory we 
chose the wave front equation 


cur) [C3) (3+ SY] - 


and the related expression for the square of the interval 
ds® = c*dt? — (da? + dy? + dz?) (35.02) 
((Vw)? is to be understood as an abbreviation for the differential expression 


on the left-hand side of the wave front equation.) If we introduce our usual 
variables 


Lo = ct; 1 =]2 5 ry; te (35.03) 
and also the numbers 
ty ls é, =e, =e, = —1 (35.04) 
the expressions (Vw)? and ds* can be written as 
3 
dw \? 
Vo)? = — 
(Ya)? = > ex 5°] (35.05) 
k=0 
3 
and ds* = ¥ ex(dax)? (35.06) 
k=0 


We know that both these expressions are invariant under Lorentz trans- 
formations. If new coordinates 

Rae: Beek, aye wee. (35.07) 
are introduced which are connected with the previous ones by a Lorentz trans- 
formation, we get 


2 Oo) \* 
(Veo)? = ex <5] (35.08) 
— On, 
k=0 
3 
and ds? = > ex(da,)? (35.09) 
k=0 


Variables such as (35.03) or (35.07) in which (Vw)? and ds? have the forms 
(35.05) and (35.06) or (30.08) and (30.09) will be. called Galilean coordinates, 
this term now being understood to include the time. 

We now assume that while 2}, 2, x, and x, are given by (35.07) as before, 
and so are Galilean coordinates, the quantities x, 7,, v. and x, are no longer 
equal to (35.03) but instead are some auxiliary variables connected to 2), 21, 
x, and x, by relations of the form 


t. = f,(2o, 1; Le; £3) (x = 0, L, 2, 3) (35.10) 
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where the f, are arbitrary functions subject only to some general conditions. 
We shall assume that the equations (35.10) can be solved for Xo, Ly, ZT, and zy 
so that their Jacobian must be non-zero 

D(x, ae %; 3) 

Dias eee) #0 (35.11) 
Further we suppose that the functions f, have continuous derivatives of the 
first three orders. There will be other conditions on the J, which arise from 
physical considerations ; these will be examined later. 

If this change of variables is made (Vw)? becomes a homogeneous quadratic 

form in the first derivatives with respect to the variables Zo, 2%, L_ and xz, 
We write this form as 


3 
Ow Ou 
Veo)? = gre ee 
(Vo)? = > gt = Se (35.12) 
a,R=0 i B 
3 
Cx, Ox 
h ap ba 
where 9 ae ee (35.13) 


3 
ds?= % 9,4 dx, dx, (35.14) 
a, 8=0 
3 , , 
Ox, OX: 
ith = eae pee : 
wi Iu 2 a (35.15) 
It is readily deduced from (35.13) and (35.15) that 
: eho ihc Mk ifv=u 
Zoo = Blo ityah ee 


Hence if g is defined as the determinant 
g = Det gu (35.17) 


the quantities g** will be the first minors of this determinant divided by g itself. 
Using the rule of determinant multiplication we get 


Det (cx =) Det (=) = Det 9,, (35.18) 
OL 5 OL, 
Here the second factor is equal to the Jacobian D of (35.11) and since 
€o€1€,€4 = —1 (35.19) 
the first factor is —D. Consequently 
g= —D? (35.20) 


It is useful to restrict the choice of variables ro, 2, x, and x3 by conditions 
which ensure that 2p, like x}, is of the nature of a time whereas 2,, x, and 2, 
are of the nature of spatial coordinates. These conditions must be formulated 
precisely. As before, we mean by the term “ event” a spatial pomt considered 
at a particular moment in time; it may be called a “ point-instant”. We 
Jemand that two events having the same values of the spatial coordinate 
parameters 2, Z, and z, but different values 2} and 2** for their time para- 
neters shall be in time sequence in the sense of Section 12. We know that for 
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events in time sequence the squared interval is positive. This must be true in 
particular for an infinitesimal interval, so if we take the difference 2>* — ze 
to be infinitesimal and put 


" oF = Gq; xt* = rq + day (35.21) 
we must have ds? = gy,dx2 > 0 (35.22) 
whence Yoo > 9 (35.23) 


Assume further that we have two events with the same time parameter 2) 
but different values of the spatial parameters, namely (x,, £2, Z3) and (z, + 
dx,, L, + dax,, t, +dzx3). We require that two such events shall be quasi- 
simultaneous in the sense of Section 12. For quasi-simultaneous events ds? 
is negative, therefore we must have 


3 
ds? = > Gtk dx; dx, <0 (35.24) 


i, k=1 

whatever the values of dx,, dx, and dx, provided not all three are zero. It 
follows that the quadratic form (35.24) must be negative-definite. It is a well 
known algebraic fact that the necessary and sufficient conditions for this are 
the inequalities 


911 912913 
9219 22923| <9 (35.25) 
931 932943 
911 912 Go2 923 >0; 911913 > 0 (35.26) 
Jar Joe 932933 931933 
Iu <9; G22 <9; Jas < 9 (35.27) 


which, incidentally, are not all independent. An independent set of con- 
ditions is, for instance (35.25), the first inequality of (35.26) and the first of 
(35.27). 

It is not difficult to show that if all these conditions are imposed on the 
coefficients g,, then, regardless of whether they are given by (35.15) or not, 
it is possible to represent ds? in the neighbourhood of any point as the sum of 
four squared terms, one with a positive and the remaining three with negative 
signs. The set of signs of the terms is called the signature of the quadratic 
form. In our case the signature can be written as (€9, €), &, €3) or as (+ — — —). 

It follows from the inequalities (35.25) to (35.27) that the determinant g 
is always negative and also that similar inequalities involving the g®* with 
upper indices hold, consequently we have 


g >0 (35.28) 


3 
and > game <0 (35.29) 
{, k=1 


where @,, @, and w, are any three numbers, not all zero. We shall not give the 
proofs of these purely algebraic statements. 

Thus, in order that the parameter x, should have the character of time and 
the other three, z,,z, and x5, the character of spatial coordinates, it is necessary 
and sufficient that go. should be positive and that the quadratic form with 
the coefficients gex (2, & = 1, 2, 3) should be negative-definite. There is no need 
to impose any restriction on the quantities 919, Joy and gq. 
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Let us now consider: the geometrical meaning of the equations x, = const. 
and x, = const. We shall derive a condition under which the equation 
w(x, y, 2, t) = 0 (35.30) 
can be interpreted as the equation of a surface in moticn. It follows from this 
equation that the differentials of space and time coordinates, are related by 
Oz dx + Oy dy + Wz dz + @: dt = 0 (35.31) 
where wz, @y, Wz and co, denote the derivatives of w with respect to 2, y, 2 


and t. We take a displacement (dz, dy, dz) in the direction of the normal to the 
surface and put 


: Wz Wy Wz 
dz = ————. dn; dy = ———-dn; ae 
= |grad «| re 2 | grad w| Ee | grad «| a 


(35.32) 


so that |dn| is the absolute value of the displacement. Inserting into (35.31) 
we get 


| grad w|dn + widt = 0 (35.33) 
and therefore the square of the displacement velocity 
dn\? 
z2— ff 
v ( ;) (35.34) 
will be given by 
fe Oe 35.35 
(grad w)? oe 


Thus (35.30) can be interpreted as the equation of a surface, each point of 
which moves normally with a speed given by (35.35). However, such an inter- 
pretation is only possible as long as this speed does not exceed that of light. 
According to (35.35) and (35.01) this means that we must have 


(Vw)? < 0 (35.36) 


The equality sign is valid for motion with the speed of light. 
On the other hand, if 


(Voo)2> 0 (35.37) 
equation (35.30) can be solved for the time and written in the form 
1 
t= -f(c, y,2) (35.38) 
C 
with (grad f)?< 1 (35.39) 


Equation (35.38) assigns to every point in space a definite instant of time in 
such a way that all the four-dimensional ‘“ point-instants”’ are quasi-simul- 
taneous. Such an equation may be culled a “time-equation”. We recall 
that time equations occurred in Section 3 in connection with the question 
of the characteristics of Maxwell’s equations. 

As we remarked in Section 3, an equation w = 0 can be considered as the 
equation of a hypersurface in the four-dimensional space-time manifold. Such 
hypersurfaces can then be divided into two classes. 

If (Vw)? <0 we can say that one of the dimensions of the hypersurface is 
time-like (the inaccurate phrase “ the surface is time-like ” is sometimes used). 
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By (35.35) this describes an ordinary two-dimensional surfacef moving with a 
velocity less than that of light. 

If (Vw)? > 0, on the other hand, we say that the hypersurface 1s space-like. 
It then represents the whole of infinite space, the various points of which are 
all taken at different instants of time, the time ¢ at which the point (z, y, z) 
is taken, being determined by the time equation, i.e. the equation of the hyper- 
surface ; the instants of time assigned to any two points in space must be so 
close that the corresponding four-dimensional intcrval is always space-like. 

We use the fact that (Vw)? is an invariant and in turn put w = %, © = %}, 
w =2,andw =2;3. This gives 

(Vara)? = g®> 0 (35.40) 
and (Vay)? =g!<0; (Va)? =922<0; (Vzy)? = 9? <0 (35.41) 
Hence the equation x, = const. is a time equation and the three equations 
ar, = const. (k = 1, 2, 3) represent surfaces moving in the direction of their 
normals with less than light velocity. These latter are thus equations of moving 
spatial coordinate surfaces. 

It follows also from our conditions on the transformations of space and 
time coordinates that constant values of 2, x, and a, correspond, in any 
inertial frame of reference, to motion of a pcint with less than light velocity. 

In classical Newtonian mechanics one often uses a time dependent co- 
ordinate transformation which is interpreted as passing to a moving frame of 
reference. In comparing coordinate transformations in Newtonian mechanics 
with the transformations of space and time coordinates in the Theory of 
Relativity it is essential to realize the following. Firstly, in the general case of 
accelerated motion the very notion of a frame of reference in Newtonian mech- 
anics is not the same as in Relativity. The Newtonian concept involves the 
idea of an absolutely rigid body and the instantaneous propagation of light. 
In Relativity, on the other hand, the notion of a rigid body is used, if at all, 
not in an absolute sense but only for non-accclerated motions and in the absence 
of externa] forces, and is of an auxiliary naturc; the concept of a frame of 
reference is not based on it but on the law of wave front propagation. The 
prototype of a Newtonian frame of reference is a rigid scaffolding, the prototype 
of a Relativistic one is the radar station. Secondly, the class of transformations 
permissible in Newtonian mechanics is much wider than in the Theory of 
Relativity ; Newtonian mechanics does not have to consider the limitations, 
discussed above, which arise from the existence of a limiting speed. 

As an example we consider a transformation which can be interpreted in 
Newtonian mechanics as going over to a uniformly accelerated framc. Let 
a’, y’, z’ and t’ be the space and time coordinates in an inertial frame, i.e. 
Galilean coordinates. We put 


#: 1 , ? 
% =a” —-al?, yy ys C= 2 (35.42) 


; a 
and also (==t— = ta, (35.43) 
oe 


t In the four-dimensional manifold a hypersurface has three dimensions but in the 
present case only two of these are spatial, 
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The variables z, y, z and ¢ can be interpreted as space and time coordinates 
in a certain accelerated frame (in the Newtonian sense and in the corresponding 
approximation). Inserting (35.42) and (35.43) into the expression for ds? we get 


2 
ds® — (c? — 2ax — at?) dt? — dx? — dy? — dz? + = (xdt+tdx)® (35.44) 
c 


The required inequalities for the coefficients will hold if the conditions 


1 art ; az\? at? ; 
ae ae 1-S) -=>0 (35.45) 
are satisfied. In addition we can require that 
2 aia SG 35.46 


These inequalities show that the substitutions (35.42), (35.43) are permissible 
only in a part of space and only for a limited length of time. 

Another example is the transformation corresponding to the introduction of 
a uniformly rotating frame. We put 


f 


a ee 
and obtain 
ds? = [c? — w(x? + y?)] dt? — 2u(y dx — x dy) dt — dx? — dy? — dz? 
(35.48) 


The conditions on the coefficients require 
c? — w(x? + y?) > 0 (35.49) 

which is satisfied only for distances from the axis of rotation less than that 
where the linear velocity of the rotation equals the speed of light. 

We stress once again that the examples given here have physical sense only 
in a region in which Newtonian mechanics is applicable (see also Section 61). 

It is obvious that the introduction of ordinary curvilinear spatial coordinates 
is always an allowed transformation. As long as the transformations do not 
involve time they have the same geometrical meaning as in non-relativistic 
theory. Therefore we refrain from discussing them. 


36. General Tensor Analysis and Generalized Geometry 
In the previous section we considered the expressions 


a Jo) Co) 
i af 36.01 
a= eS au, ry 
a, (s=0 = 
and 
ds? == > Gy, dz, dL, (36.02) 
a, (--0 


which were obtained from the usual expressions of Relativity Theory by 
introducing variables 7, 72, 23 and Zp in place of the space and time coordinates 
x, y, z and t. We established the conditions subject to which the variable Zo 
can characterize a sequence of events in time and the variables ,, x, and 2, 
their location in space. 
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By itself, the introduction of new variables can naturally not influence the 
physical consequences of the theory ; it is merely a mathematical device. How- 
ever, the development of a formalism which permits the statement of equations 
of mathematical physics (such as equations of motion and field equations) 
directly in terms of arbitrary variables without the use of cartesian spatial 
coordinates and time, is not only useful as a device for convenient computa- 
tion but is also important in principle. The existence of such a formalism can 
show the way to generalize physical theories. 

We shall call equations generally covariant, if they are valid for any arbitrary 
choice of independent variables. The formalism that allows one to write down 
generally covariant tensor equations will be called “ general tensor analysis. ” 

Generally covariant equations are already used in Newtonian mechanics. 
We refer to Lagrange’s equations (of the second kind) which describe the motion 
of a system of mass points in generalized coordinates and also their generalization 
for continuous media. While they state nothing physically new as compared 
to equations in cartesian coordinates, Lagrange’s equations nevertheless play 
an important part both in practical applications and in theoretical investiga- 
tions. In the Theory of Relativity general tensor analysis has a similar purpose. 

In general tensor analysis the starting point is the pair of equations (36.01) 
and (36.02) giving the square of the four-dimensional gradient and the square 
of the interval. One says that these expressions characterize the metric of space- 
time. The coefficients g** and y,, entering the equations are thought of as 
functions of the variables x, 2, Z and 23. 

We have so far assumed that the expressions (36.01) and (36.02) are derived 
from (35.01) and (35.02), or from (35.08) and (35.09), by introduction of new 
variables, so that the coefficients g*° and g,, can be represented in the forms 
(35.13) and (35.15). In this case the ten coefficients Ju, can be expressed in 
terms of the four functions fo, fj, fg and f, as follows : 


3 
Jus = > Ce = = (36.03) 
k=0 


By virtue of (35.16) the g** can be expressed in terms of the same four functions. 
However, it is important to note that the equations of general tensor anlaysis 
are hardly any more complicated if it is not assumed that the g,, can be rep- 
resented in the forin (36.03) but that instead they are taken simply as given 
functions of the coordinates, i.e. of the variables, zy, 7,, 22 and z3. This more 
general point of view corresponds to the introduction of non-Euclidcan geometry 
and a non-Huclidean metric in space time. Such a step takes one beyond the 
limits of ordinary (so-called “ Special”) Relativity and is connected with the 
formulation of a new physical theory, namely Einstein’s Theory of Gravitation. 
Later chapters of this book are devoted to this theory, but in this chapter, we 
adopt a purely formal point of view and develop general tensor analysis on 
the assumption that the metric is given and the g,, are known functions of the 
coordinates. Such a presentation has two advantages. In the first place we 
can find the conditions which the g,, must satisfy in order to be expressible 
in the form (36.03); this gives us a generally covariant formulation of the 
usual Theory of Relativity. In the second place we obtain in this way the 
mathematical apparatus for formulating Linstein’s Theory of Gravitation. 
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Before going on to a systematic exposition of general tensor analysis we 
establish the connection between the expressions (Vw)? and ds? which exists if 
3 
> Gus” = &, (36.04) 
a=0 


regardless of whether the Due are of the form (36.03) or not. We show that if a 
function w(x», £1, L_, £3) satisfies (Vw)? = 0 then differentials of the coordinates 
related by the condition w = const. satisfy ds? = 0. 


Putting 
a. Guszyho 36.05 
Oe Ba, « = 0, 1, 2, 3) (36.05) 
We write (Vw)? = 0 in the form 
3 
G= F fPoog = 0 (36.06) 
a f=0 


This partial differential equation for w is of the same type as the Haiilton- 
Jacobi equation of classical mechanics and can be solved similarly to the 
latter. If we solve it for w, and write 

®9 = —H(o,, &., Ws) (36.07) 


the function H will correspond to the Hamiltonian and Hamilton’s equations 
will be 


de ae OOF 6 NE (k =1, 2, 3) (36.08) 
dt,  Berx dig OxK 
But 
OH Bq _ (G/@wx) (36.09) 
OOK Gwe — (EG Ae) 


and the first three equations of (36.05) show that the differentials dz, (« = 0, 
1, 2, 3) are proportional to the partial derivatives of G with respect to the ,. 
Denoting the infinitesiinal coefficient of proportionality by dp, we have 


dp 0G 3 : 
ae | By), 36.10 
lige ap > 9 Wy ( ) 


Solving for w, with the use of (36.04) we get 


wo, dp = S Yas di, (36.11) 
and the obvious relation — 
an dz, = 0 (36.12) 
then gives ics 
ds? = S Jay (La, tg = 9 (36.13) 
a =O 


as required. Thus if we continue to consider the equation (Vw)? = 0 as des- 
cribing a wave front we can take it that for points on the wave front the differ- 
entials of space and time coordinates are related by ds* = 0. . 

In the following we shall consider the g,, as given functions of the variables 
Lo Z, L, and x, and shall inerely assume that they have continuous derivatives 
of all orders considered and that they satisfy the inequalities stated in Section 
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35. In addition to the g,, we shall consider the g*°, their connection being 
(36.04). The conditions under which the g,, can be represented in the form 
(36.03) will be established in Section 42 below. 


37. The Definitions of a Vector and of a Tensor. Tensor Algebra 

In tensor analysis one frequently has to deal with sums similar to those in 
(36.01) and (36.02), in which the summation index appears twice. Following a 
suggestion by Einstein we introduce an abbreviated notation for such sums 
which consists of omitting the summation sign, summation over indices appearing 
twice being understood. We further agree to sum from 0 to 3 if the indices 
are Greek characters such as («, 8, ...) and from 1 to 3 if they are Roman ones, 
such as (2, k,...). Using this notation we can, for instance, replace 


3 
ds? = > qq Ix, dy (37.01) 
a B=0 
simply by 
ds? = g,, dz, dx, (37.02) 
or, if we wish to single out the time-like coordinate z, 
ds? = go, dxg + 29g, dx, dx, + giz dui dre (37.03) 


This short notation proves very convenient and does not lead to misunder- 
standings. In the rare cases when sunimation over a double index is not to be 
understood we shall state this explicitly. For instance, in the special case 
when (36.02) reduces to 
ds? = dx? — da? — dx — dag (37.04) 
we write Jug = adap (no summation) (37.05) 
In Section 20 we gave the definitions of a covariant and of a contravariant 
vector for the case (37.05). Equations (20.12) and (20.13) can now be written as 


Cx 
Ai = — A, 
a= Fei A, (37.06) 
ox’ 
and AS AP (37.07) 
OL, 


and they can also serve as definitions of a veetor in the general ease. As before, 
covariant vectors will be written as letters carrying a suffix while for contra- 
vanant ones a superfix is used. Towever, for the coordinate differentials we 
make an exception to this rule, writing them as dr, although they form a contra- 
variant vector. 

Thus a covariant vector can be defined as a set of four quantities that trans- 
form like the four partial derivatives of some function with respect te the 
coordinates, Sinularly, a contravariant vector is a set of four quantities that 
transform like the four coordinate differentials. 

Tn the case when ds? has the form (37.04) and when we consider only Lorentz 
transformations, the cocllicients in the transformation formulae (37.06) and 
(37.07) are constants. In the more general case of arbitrary transformations 
the coefficients are variable. In the case of Lorentz transformations a vector 
necd not necessarily refer to a definite point in space-time but may be “ free ”. 
An exainple of a free vector is the energy-momentum vector of a material 
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system. (Similarly the tensor of angular momentum and mass centre motion 
is afree tensor.) By contrast, if one considers general coordinate transformation, 
then, just as for ordinary curvilinear spatial coordinates, a vector must neces- 
sarily be related to a particular point in space. Examples of bound vectors are 
any field vectors, such as the velocity vector for a continuous medium, whose 
components are functions of a point, i.e. of the coordinates x», z,, x, and x3. 
However, a bound vector need not be defined throughout a finite region of space- 
time ; it may be defined only on some curve as is the case for the tangent vector 
of a curve, or on some surface as for the normal to a surface. The same remarks 
apply to tensors, and in general tensor analysis we shall always have to do with 
bound tensors as well as bound vectors. When these are transformed the 
quantities ar,/8x, entering the transformation must always be taken at the 
same point as the vector or tensor to be transformed.+ 

‘his definition of a tensor is analogous to the one given in Section 21 for the 
case of Lorentz transformations. Equations (21.01), (21.03) and (21.05) which 
give the transformation rules for second order covariant, contravariant and 
mixed tensors respectively remain valid in genera] tensor analysis. In our 
present notation they read : Y= an, Wy 


== (37.08) 
v , ? af r 
- dx, Ox, 
T'e = at ~ Te (37.09) 
@ Cc) 
an’ a 
and T's = at = Ts (37.10) 


Thus a covariant, contravariant or mixed tensor of the second order is a set of 
quantities transforming, respectively, according to (37.08), (37.09) or (37.10). 

Covariant or contravariant, but not mixed, tensors which have either of the 
properties of symmetry or antisymmetry, retain that property after trans- 
formation. For on renaming the suffixes in (37.08) we obtain 


a set (37.11) 
a ia ia 
Therefore, putting 28.g = Pap + Toa (37.12) 
and 2A ig = Tag — The (37.13) 
and defining S’,, and A’,, correspondingly we get 
Pa le (37.14) 
uw ea’ ax, 8 
vs \ 
On, OL, 
ee (37.15) 
and py en, ax, af 


The quantity S,, can be called the symmetric part and A,, the antisymmetric 
part of the tensor T,,. Equations (37.14) and (37.15) show that each of these 
two parts separately is a tensor. Therefore, if the tensor 7’,, is itself symmetric, 
so that A,, =0, then also A’, = 0, i.e. the transformed tensor T',, is also 


symmetric. Similarly, if 7, 18 antisymmetri 
S,, =0 and the transformed tensor T,,, 


+ For further discussion of free vectors see Section 49. 


c so that S,, = 0 then likewise 
remains antisymmetric. The same 
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considerations apply to a contravariant tensor, but for a mixed tensor the 
upper and the lower index enter the transformation equation (37.10) differently 
and therefore it has no invariant meaning to divide a mixed tensor into a 
symmetric and an antisymmetric part. 

An extremely important example of a symmetric tensor of second rank 
is the set of coefficients g,, in ds’. The fact that the g,, are symmetric in their 
two suffixes follows directly from their definition. That they form a tensor 
follows from the invariance of ds. For on passing to new variables 2), 21, 
x, and x, we have 


Guy I%,, AX, = Jug IL, dk, (37.16) 
: On, Oz 
whence Juv = Jap aa . Ba! (37.17) 


which is just the tensor transformation rule. The tensor g,, is called the 
fundamental or metric tensor. 
The set of quantities g* defined by the equation 


9° "Gay = 8S (37.18) 
also form a tensor, which in this case is contravariant. We proceed to prove 


this. In the primed coordinate system the gi, are given by (37.17) and the 
g'’ have to be calculated from the equation 


g''9;, = SY (37.19) 
On the other hand, if g*® is to be a contravariant tensor, we must have 
x, Ox; 
"pA — gif Uk "TA 
a Lares (37.20) 


We must show that both definitions of the g’"’ give the same. Since the solution 
of (37.19) for the g’*” must be unique it is sufficient to verify that the quantities 
given by (37.20) satisfy (37.19). This is simply done by use of the equations 


a a §% 
Ga, On, dn, °° wen) 
Ox, On 
and fe pais 
Dry 0a, Joo ox. (37.22) 


Inserting (37.20) into (37.19) and using first (37.22) and then (37.18) we obtain 
the sequence of equations 

Gees , Om, x, 
II, = Foy =* = = 979 


en), am, ox), Or, Ox Oa 
Ox, 0%,  * Be éx, Ox; ie dx, Ox, dz, Ox, ' 


which finally leads to (37.19). 

The g*° are called the contravariant components of the fundamental tensor, 
the g,, being its covariant components. 

The equation 


83 #3 = (37.24) 


which forms part of (37.23) shows that the set of quantities 


ge J if =v 
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which aré the game in all coordinate systems, satisfy the definition of a mixed 
tensor of second rank. 


Given a covariant vector A, the set of quantities 


AY = gh A (37.26) 
will transform as a contravariant vector, as is easily verified. We do not consider 
two vectors so related as essentially different and refer to the A, and the A" 
as the covariant and contravariant components of the same vector. The 
operation described by (37.26) is referred to briefly as raising the index of the 
vector A, and the inverse operation 


A, = 9,,A" (37.27) 
is called lowering the index. Raising and lowering of indices can also be per- 
formed on tensors. For instance, from the covariant tensor T,,, we can obtain 


the contravariant tensor 
ES Ge Gla. (37.28) 


by raising both indices, and the original tensor can be restored by the inverse 
operation 


Py = IuaIgl (37.29) 


It is evident that the properties of symmetry and antisymmetry are not 
affected by this. In forming a mixed tensor we must observe which of the 
indices is raised or lowered. For clarity we can mark the position of the raised 
or lowered index by a dot. For example, the tensors 


Th. = gT, (37.30) 
and Tage (37.31) 


are identical only if 7',, is symmetric, in which case the dots are unnecessary 
and we can write 
DY = Dy = GT a (37.32) 
For the sake of brevity we have hitherto discussed only vectors and second 
rank tensors, but tensors of higher rank may be defined correspondingly. 
A set of quantities is called a covariant tensor of rank n if it undergoes the 
transformation 


CL. OF Ou 
ay ao 


ot 
: = as oe Z 37.33 
Aa os... Bx Bee a ox, Ox, ox ( ) 


Py Be Bu 


Similarly a set of quantities is a contravariant tensor of rank n if it transforms 
as follows 


éx,, Ox, aux, 
BiP1b2-- Bx _. BMte---%n =—' ; Bo * == (37.34) 
Vay Ze, Oy 


Finally a mixed tensor of rank » with k covariant and m contravariant indices, 
where k + m =n, is a set of quantities transforming according to the rule 


da" Gx Or , oe 
Cites Hm. (%1 +++ mn BL ge og ae E>) 
Vy. YE By ...Bx Or Ox, ae a 
he 
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From a covariant tensor of rank n we obtain a contravariant tensor of the same 
rank by the relation 
Abi: En — An... ete grtn (37.36) 


and we can speak of these quantities as the covariant and contravariant com- 
ponents of one and the same tensor. If only some, but not all, indices are 
raised we can obtain mixed tensors of the same rank and, in general, we should 
indicate the position of the indices that are raised by the use of dots. If we 
have two tensors of rank & and m we can construct a tensor of the higher rank 
n =k-+m by multiplying components. For covariant tensors we may put 


Ay. 0, Bay yy... = Cay...n (37.37) 


and similar equations hold for contravariant and mixed tensors. 
j ir 
Givenatensor Ay 4, of rank n we can construct another tensor By. .a, 


by the rule 
Ae, ...n—2BY9PY = Bay. .oin—2 (37.38) 

The rank of the new tensor is two less than the rank of the original one. This 
operation is called contraction with respect to the two indices involved. In 
(37.38) the contraction is with respect to the last two indices and it is evident 
that the result of contraction will depend on which pair of indices is chosen. 
Contraction may be performed in two stages. First one of the suffixes is raised 
according to the rule 

ce Pe rr = Ay ; as (37.39) 
and then the other suffix, «,_,, is equated with the raised index B and the 
summation is performed 


7 oe B B (37.40) 
2... Oy 2B Oy ...8n_2 
Contracting a second rank tensor produces a scalar 
Tg” =T (37.41) 


which can also be given as 

PST (37.42) 
That the contraction of either of the two tensors (37.30) and (37.31) leads to 
the same result is easily understandable because the scalar 7 depends only 
on the symmetric part of the tensor T,,, and for a symmetric covariant tensor 
both the related mixed tensors are the same. 

A vector cannot be contracted because it has only one index, but from two 
vectors one can construct a second rank tensor which may then be contracted. 
The result is the scalar 

g’ AB, = A, BY = AB, (37.43) 
which can be called the scalar product of the two vectors A, and B,. If the 
two vectors coincide, the scalar is 


gA,A, = A,A* (37.44) 
According to the sign of the scalar product of a vector with itself, all vectors 
may be classified as being time-like, if 4A” > 0, space-like if A pA¥ <0, or 


nul vectors if A,A* =0. This classifieation 4 is the same as that discussed 4 in 
Section 20. 
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If two time-like vectors -A" and B* both have unit invariants the absolute 
value of their scalar product will be greater than unity. In other words the 
conditions 


IyyAvAY = 1; IuyB" BY = 1 (37.45) 
lead to the inequality |JuyA"BY| > 1 (37.46) 

This will now be proved. Condition (37.45) for A" can be written as 

0 1 t : _ [FotJox _ 
V(Joo)A® + Joi ——~ — giz) APAK = 1 (37.47) 
VIoo Joo 
: 1 
or more briefly as a (Ao)? — a4,AtdA* = 1 (37.48) 
00 
with ay, = Lor _ gy, (37.49) 
Joo 
bs 1 
Similarly BB = a (Bo)? — axBtBe = 1 (37.50) 
00 


On the other hand the scalar product A,B is equal to 
A,BY= ~ A,By — ay AtBe (37.51) 
00 


By virtue of the inequalities for the g,, proved in Section 35 the quantities 
a4, are the coefficients of a positive definite quadratic form. Therefore 


|anAtBk| < 4/(anAtA*) - +/(a1,BtB*) (37.52) 
and if we put A = 4/(axAtAF) ; B= V/(a1,BtB*) (37.53) 
we obtain | Ao| = VIovV(1 + A?*); | Bo| = VIov(1 + B?) 
|a1,A‘Bt| < AB (37.54) 
Therefore |4,BY| > /(1 + A*%)/(1 + B*) — AB > 1 (37.55) 
as required. 
For general time-like vectors it follows from (37.46) that 
|uvA"B| > A (G4"A”)-V(9,,,B"B") (37.56) 


and it should be noted that this relation is analogous to (37.52) except for a 
reversal of sign. This is due to the indefiniteness of the metric. 

For a time-like vector the covariant and the contravariant components of 
index zero always have the same sign. Assuming that A, >0 and B, > 0, 
and thus also 4° > 0 and B® > 0, we can write (37.56) in the form 


A,BY > 4/(A,A’)-4/(B*B,) (37.57) 


To conclude this section we discuss the concept of a pseudo-tensor in general 
tensor analysis. In Section 22 we introduced the set of quantities ¢,,/3 which are 
antisymmetric in all their suffixes with ¢o,.,; = 1. These quantities satisfy 
(22.02) which, in somewhat different notation, can be written as 


sips ee ey lh ge (37.58) 
pw 
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Here D is the Jacobian} (35.11). On the other hand we can easily show by use 
of the rule of determinant multiplication that 


Det g,, = D* Det g,, (37.59) 
This is a generalization of (35.18). Rewriting this equation in the form 
g = Dg’ (37.60) 
and taking the square root, after first changing the sign, we get 
V(—9) =|DIV(-9) (37.61) 


This is the transformation law for the determinant g. We now multiply both 
sides of (37.58) by +/(—g’) and put 
BE ee = 91 9) es (37.62) 
and Eegys = V(— 9)" Fapys (37.63) 
Then we can write (37.58) in the form 
, Ox), Ox, Ox, Ont = 


—* —§ _1 __- = spn D- E,,, (37.64) 

ane 0x, OL, OL, OL, in 
where sgn D = +1 is the sign of the Jacobian D. This equation shows that for 
transformations with positive Jacobian the quantities Z,., transform as a 


covariant fourth-rank tensor while for transformations with negative Jacobian 
their transformation rule differs only by a sign from the rule for such a tensor. 
A set of quantities with such a transformation law will be called a pseudo- 
tensor. The corresponding contravariant pseudo-tensor is obtained by the 
general formula 

Biever — 4/(— g)- gagr?grrg eas (37.65) 


: 1 
and is Revs = — —_____¢ (37.66) 


Vor aa 


The antisymmetric fourth-rank pseudo-tensor allows us to derive from any 
antisymmetric tensor A,, of second rank a dual pseudo-tensor 


* 1 
Aw’ — 5 EA. (37.67) 
and from any antisymmetric tensor A,,, of third rank a dual pseudo-vector 
* 
Ar 5 Er Aas (37.68) 


If we construct an antisymmetric tensor A,,, from the three vectors a, Bu Cy 
according to the rule 

Ans = a,b,c; + abseg + AsbnC,, _— a, bacs = AgbsC,, — a,b,c, (37.69) 
the dual pseudo-vector has the components 


a 1 414243 Xk 1 29 42 23 

o> — b, by bg}; Al= + by be bg 

WD) eee: A cet) ee 
(37.70) 

i, 1 2 4,43 A941 a, 

a= by by ba); $= +} by b, bg 

sae Pe ee VA Dien ae 


f In (22.02) D denoted the Jacobian of the inverse transformation. 
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These are proportional to the minors complementary to the elements bi Sie 
and &, of the determinant 


E5 ey Ee Es 
_ [40 41 42 G3 
A= bb obe Bs (37.71) 
Co Cy Cy Cy 
; * * * 
Obviously A*ta,=0; A%,=0; Ate, =0 (37.72) 


so that the pseudo-vector A* is perpendicular to all three vectors a,, 6, and 
c,. It may be called the vector product of these three vectors. 


38. The Equation of a Geodesic 


We consider two point-instants corresponding to two events in time sequence 
and we denote their coordinates by a) and 2!) respectively. Let a material 
point move along some curve in such a way that when a, = 2 its spatial 
coordinates are x, = af) and when x») = a) they are z= ae. 

As the events 2!” and z?) are assumed to be in time sequence such motion 
is possible with a speed less than that of light. The time a, and the spatial 
coordinates ; corresponding to it can be expressed parametrically in terms of 
an auxiliary variable p, by putting 


x, = 9*(p) (38.01) 

with a) = o%(p,); 2 = @%(p,) (38.02) 
Since the speed of the motion is less than that of light the inequality 

ds? = g,,0%6* dp? > 0 (38.03) 


must hold for any infinitesimal interval along the path. Here a dot denotes 
differentiation with respect to p. The finite interval between the two events 
in time sequence which is proportional to the interval of proper time ¢, will be 
denoted by s = ct and we have 
pe 
s = 07 = | VGagb"9") dp (38.04) 
Py; 

We now consider two quasi-simultaneous events. The two points in space 
at which:the events take place can be joined by some curve and to each point 
on this curve we can assign a definite instant of time, i.e. we can write down the 
‘‘ time equation ” for each point, taking care that any two intermediate space- 
instants are quasi-simultaneous. The analytic expressions for the curve and 
the time equation may again be stated im the form of equations (38.01) and 
(38.02), but we can no longer interpret these equations as describing the motion 
of a point along a curve ; they now give a static description of the curve as a 
whole. For any pair of intermediate points, infinitesimally separated, we have 

ds? == Iago?" adp?<0 (38.05) 
and the space-like interval 
rm . 
[= | +/( nee G4s0°P") : dp (38.06) 
a 
characterizes the length of the curve. 
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The question arises of the extremal values of both the time-like interval 
(38.04) between two events in time sequence and the space-like interval (38.06) 
between two quasi-simultaneous events. Both these variational problems lead 
to equations of the same form, whether the interval is time-like or space-like. 
The variational equations are called the equations of the geodesic by analogy 
with the theory of surfaces. However, it is important to note that whereas 
in the theory of surfaces, where the square of an infinitesimal distance is a 
positive definite quadratic form of the coordinate differentials, the geodesic 
is, generally speakingf, a shortest line; in the four-dimensional space-time 
manifold the situation is different : the extremal value of the interval is a 
maximum for a time-like interval and neither a maximum nor a minimum for 
a space-like interval. This can easily be verified in the special case of the 
Galilean metric where ds? has the form (37.04). For events in time sequence 
we can then choose a reference frame so that the spatial coordinates of the 
initial and final points are the same and we can choose the time ¢ as the para- 
meter. We then have 


(2) 
s= J a/c? — v?) - de (38.07) 
2) 
where 
dz\2 (/dy\2 (dz\3 
2 = — — — 
v= (2 (24) 8.08 


Thesolution of the variational problem in this case is given by constant values 
of z, y and z, so that v? =0. For any other trajectory v? will somewhere be 
greater than zero, so that »/(c? — v?) <c and therefore 


S< Snax = C(t) — £) (38.09) 

For the space-like interval we can choose a frame of reference such that 
02) = 4 YM Dy, 2) gD (38.10) 

while 2?) > 2), Taking the coordinate z as the parameter we obtain 


7'2) 


S Jb +(Z)'+ (=)'- o? (=) | - de (38.11) 


The solution of the variational problem is now given by constant values of 
y, 2 and ¢ for which then 
lexer = 20) — gl) (38.12) 
However, for other curves y(z), z(x) or other time equations ¢(z) we may find 
either 1 > 1,4, or 1 < 1,4, depending on whether the square root in (38.11) 
is in the mean greater or less than unity. 
We now derive the differential equations of the geodesic. The Lagrangian 
of the variational problem is 
L = Vu") (38.13) 
or, writing x, instead of o* 
L = V(Gaphqi%Xg) (38.14) 


f i.e. for sufficiently near terminal points. 
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The extremal condition for the integral 


Pe 
$= | Ldp (38.15) 
. Py 
leads to the Euler-Lagrange equations 
deb aL P aaa 
dp 08, 2a, ~ (38.16) 
We now put 
Me ‘ab 18 
F= 5 Jaataky (38.17) 
so that L=/(2F) (38.18) 
By the same reasoning as in Section 17 we can choose a parameter p so that 
aF 
—=0; F = const (38.19) 
dp 
and with this choice (38.16) is equivalent to 
OE aE 25 (38.20) 
dp 0%, x, 
These last equations possess the integral 
Le 2 — F= F = const (38.21) 
0%, 


so.that the condition (38.19) is a consequence of (38.21). Inserting the explicit 
expression for F we obtain from (38.20) 


a ee ee eee 
i Gaps) — 5 He" tatty = 0 (38.22) 
or, performing the differentiations, 

‘s 29.5. . 1 Gay. . 
Iapeg + 3, tpt, — 5 On, 70% = (38.23) 
The coefficient of ¢,4, can be symmetrized with respect to 8 and y ; if we put 

1 (a5, Gay “is 
ee (pl Mee Gem 38.24 
(bv a) = 5 (Set + Sev — 3 (38.24) 


the differential equation of the geodesic becomes 
Jagty + [By %] Zz, = 0 (38.25) 
The expression (38.24) is called a Christoffel symbol of the first kind. In 
order to solve equations (38.25) for the second derivatives we multiply them by 
g*’ and sum over «. Then with the new symbol 
{By v} = g™ (By, «] (38.26) 
we obtain %, + (By, v} Zz, = 0 (38.27) 
The expression (38.26) is called a Christoffel symbol of the second kind and is 
often represented by an alternative symbol 
{By, v} = Thy (38.28) 
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For uniformity we can also introduce a corresponding form of notation for 
Christoffel symbols +f the first kind : 

[o8, yy] = Ty 43 (38.29) 


but this is less common practice. 
We have thus 


Pca (= “he es) (38.30) 
2 \ ex, Cty Cx. 
l eg cg eg ) 
vo 1 ava f Yeo ed: 38.31 
and 4 5 g ( ar, an, an, ( ) 
In this notation the equation of a geodesic takes on the form 
az dx, dx. 
—~ +r, ~ =? =0 38.32 
dp? = af dp dp ( ) 


If the Christoffel symbols correspond to a metric tensor that can be written 
in the form (35.15) equations (38.32) are equivalent to the relations 


*_0 (k=0,1,2,3) (38.33) 


for the Galilean coordinates z,. This follows from the covariance of the equations 
and the fact that in Gallean coordinates the Christoffel symbols vanish. In 
this case, therefore, the equation of a geodesic leads to linear dependence of the 
Galilean coordinates on the parameter p. 

It is not difficult to verify that the development leading to (38.32) remains 
valid whatever the sign of F. If F >0 the “ geodesic”’ joins two events in 
time sequence and equations (38.32) can be interpreted as the equation of 
motion of a free mass point moving with a speed less than that of light. The 
increment dp of p is proportional to the increment dz of the proper time + 
and (38.32) may be replaced by 


ax, , at, dLz ie 
a ze Mea 7 0 (38.34) 


The length of the geodesic gives the interval of proper time between the 
“departure” and the “arrival”’ of the mass point. If on the other hand 
F <0, the geodesic joins two quasi-simultaneous events and we can put dp 
equal to the increment of the spatial interval. Equations (38.32) then appear as 
oF , at, dt, 0 38.35 
qe 1°38 ar at (38.35) 

The case F = 0 corresponds to a point moving along a ray with the speed 
of light. In this case the Lagrangian (38.18) is zero and the above derivation 
of the geodesic equation is no longer valid. However, the equations (38.32) 
themselves retain their meaning and as they possess the integral (38.21) they 
are compatible with the condition F =0. To justify the equations in this 
case we can start from the Hamiltonian equations that were discussed in 
Section 36. According to (36.08) we have 

dz, 6H dwt oH 


RS Seay ap Ske Boe eas he a ok — 9: QQ « 
ti, OG aes Ba (k = 1. 2, 3) (38.36) 
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where the Hamiltonian H = —a, is obtained by solving for w, the equation 
G = g"w,a, = 0 (38.37) 

Therefore we have 

1 og oG 
dH = dey Ss Se ena 
“0 (8G [Be09) (= no Bee do) eo 
Using the fact that 

dw, dH oH 98,39 
de, ~ dey Oy ae 


and expressing the derivatives of H in terms of the derivatives of G we can 
write equations (38.36) in a symmetric fashion : 

dx, 1 0G. dw, 1 0G 

bia eo oR CHU (38.40) 
Here dp is considered to be the differential of the independent variable p. 
The first four equations of (38.40) have already been given in Section 36. 
Writing the right-hand sides explicitly we get 


dx dw 1 age 
ee 38.41 
dp gree dp 2 0z,, uy ( ) 
It is readily seen that these equations are equivalent to (38.32), for we have 
On = Iya ap (38.42) 
ag’ og? d 09, &: 
and therefore eS ey " a — — guy Sur (38.43) 
ox, “ d2,~°™ dp 0x, ap 
‘ ag’ 9 1n 7) i) 
i py VY RA LT (ey = — (s*)=—0 44 
since 3, 9 +9 Se = By (9°90) 3a, (33) (38.44) 
Inserting (38.43) into (38.41) we get 
1 ag, 
ete Dey Gan Mr (38.45) 
dp 2 dx, ap 
or, in consequence of the first set of equations in (38.41), 
COy= 1 a ey (38.46) 


‘dp 2 02, dp dp 
Eliminating the w, from these equations and (38.42) we finally obtain 
4 (,, 8) —} Ma tt oe 
‘ip dp 2 dx, dp dp 
These equations are the same as the equations (38.22) from which the equations 
of the geodesic in the form (38.32) were derived. The passage from (38.41) 
to (38.47) is the usual one from Hamilton’s to Lagrange’s equations. 

We have thus proved that the geodesic of zero length is likewise determined 
by the equations (38.22) but with the condition F = 0 adjoined. 

It should be noted that because F is constant, a geodesic retains its character for 
its entire length; it may always describe the motion of a point withaspeed less than 
that of light or it may be a null-line or, finally, it may be everywhere space-like. 

For a null-geodesic the relation (38.37) with w, = @w/0x, may be con- 
sidered to be the Hamilton-Jacobi equation for the action function w. (See 
Section 36.) The Hamilton-Jacobi equation for the general case can also be 
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readily obtained. For definiteness we consider the case of a point moving with 
a speed less than that of light. a 

Choosing the time t = 2, as the parameter and denoting differentiation 
with respect to it by a dot we can write the Lagrangian of the problemt in 
the form 


L= + V(Goo + 2Gostt + Yixdite) (38.48) 
The generalized momenta are 
aL 1 . 
— = p= + = (Got + Gute) (38.49) 
OLE L 


and the Hamiltonian is found by the usual rule to be the expression 
1 
H=igp, —-L= — Zr (Joo + Joxtx) (38.50) 


with the velocities z, expressed in terms of the momenta p, by (38.49). 
If we put 


1 
Po = F (Yoo + gortk) (38.51) 
and observe that Ldt = ds (38.52) 
where s is the length of arc, the four quantities p1, p, can be uniformly written as 
dz 
Pa = Iap 7 (38.53) 
dz, d 
The identity 909 a2 ae = (38.54) 
leads to the relation 
gy” Pp, = 1 (38.55) 


which can be regarded as the result of eliminating the three velocities 2, Z, 
and #, from the four equations (38.49) and (38.51). The Hamiltonian H = —p, 
is obtained by solving (38.55) for p). The Hamilton-Jacobi equation is obtained 
by the usual rule of expressing 7,, p., p, and H as partial derivatives of S with 
respect to the spatial coordinates and to time, as follows: 
as as 
Sri ee 38.56 
H a? | PE (38.56) 
These equations can also be written as 


as 


Py = 5 (38.57) 
Thus the Hamilton-Jacobi form of the equation of a geodesic is 

as aS ; rae 

Iu aa, ba (38.58) 


If a complete integral of the Hamilton-Jacobi equation 
S = 8 (xo, 21, Lg, Lg, Cy, Cg, C3) + Cg (38.59) 


} It is convenient to introduce the Lagrangian with opposite sign to the usual convention 
in mechanics, As a result, the sign of the energy will be opposite to the sign of the Hamil- 
tonian. 
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is known, which contains three arbitrary constants c,, c, and cs, not counting 
the additive constant cy, the derivatives of S with respect to the constants, 


—=b, (k=1,2,3) (38.60) 


are also constants, as is proved in mechanics. They are determined from the 
conditions of the problem. 

A comparison of (38.58) with (38.37) shows that the equations of a null- 
geodesic are obtained from (38.58) by replacing the right-hand side by zero. 
For a space-like geodesic the right-hand side of the Hamilton-Jacobi equation 
is a negative constant which can be set equal to —1. 


39. Parallel Transport of a Vector 


In Euclidean space it is very easy to define what is meant by saying that 
two vectors at different points are equal and parallel. It simply means that their 
cartesian components are equal. Obviously this same definition holds) for 
vectors in a plane, and it can immediately be generalized to the case of vectors 
in a curved surface, provided the surface is developable. However, for a general 
non-developable surface parallelism of two vectors in it can be defined only if 
the points of application of the two vectors are infinitely close to each other. 
We can regard a vector on a surface as being a vector in space which is tangential 
to the surface at its point of application. Given a vector on the surface at a 
point P a vector at the adjacent point @ of the surface and parallel to the 
first vector in the sense of the geometry of the surface can be constructed as 
follows. We consider the given vector at P as a vector in space and we con- 
struct at Q the vector that is parallel to it in space in the usual sense. We then 
project this vector on to the tangent plane to the surface at Q. The tangential 
vector at Q is taken to be parallel to the given vector at P. 

This construction can be performed analytically as follows. Let y,, y, and 
ys be cartesian coordinates in Euclidean space and xz, and z, the coordinate 
parameters on the surface. Then the parametric equations of the surface have 
the form 


Y1 = YilLy, Le) Yo = Yo(Lr, Ze) 5 Ys = Ya(Xy, Ze) (39.01) 


and the square of the element of arc on the surface is 


ds® = 91, day + 2912 da, dx, + Joo dx, (39.02) 
3. Oyn OYn 
where Ji = Bey Bx, (39.03) 
n=1 


Let A, and A, Le the covariant components of some vector at the point 
P(x, £5) on the surface and A} and A? its contravariant components. We can 
look upon this vector as a vector in space with the cartesian components 


Yt pe gt 1B) (39.04) 
OL, OL 
3 Cn 
and we have = Vos] l=1,2 (39.05) 
At > = oxy ( ) 


n=1 
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If on passing to the point Q(z, + dx, %, + dz.) we do not change the cartesian 
components Y,, we get a vector in space which is no longer tangential to the 
surface. However, its tangential components define a vector on the surface, 


namely 


Ox) a= OL, OL 


3 9 2. 92 
Ae > ¥n( a ve Bae] (39.06) 
n=1 


and this vector is taken by definition to be the result of parallel transport of 
A; to the point Q, in the sense of the geometry on the surface. The normal 
component of Y, evidently drops out of equation (39.06). 

The expression that appears as an addition to dyn/0z; in that equation takes 
account of the change of this quantity in going from P to Q. Owing to this 
additional term the component A; receives the increment 


3 2 By n 
= Y 8 39.07 
84) 2, n ray Ox, OL] Like ( ) 


Insertion of Y, from (39.04) gives 


Cyn "yn 


(39.08) 
Ox; OLE OX, 


2 3 
dA) => Aga, 
It is readily shown using the expression (39.03) for the gy that the sum over 
n in (39.09) is 


3 
Yn Yn ( ik Gil zy (39.09) 


Ls Oa; Oxy Ox, 2\ Om | Bey Oxy 


or, using the definition (39.30) for the Christoffel symbols, 


3 
Cyn OYn 
me ee 
pa ene (39.10) 
Thus the increraent of the components of any vector in parallel transport is 
2 
A, = > T3,4.4 8xy (39.11) 
1,k=1 


It is important to note that this increment depends cnly on the internal pro- 
perties of the surface, which are determined by the expression (39.02) for ds?. 
The theory of the parallel transport of vectors was developed by Levi-Civita 
[14] and his pupils ; it can be formulated almost without change for the case 
of the four-dimensional space-time manifold. 
Let the coefficients of the quadratic form 


ds? = 9, dz, dx, 39 19 
be represented as . P (39.12) 
Ny 
ae Oyn OYn 
Jan = 2 on Big (39.13) 
N= fod ea 


where the numbers €n are +1 and the 


Yn = Yn(Zo, Z, Ly, Ls) (39.14) 
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are some functions. In the ordinary Theory of Relativity the g,, are obtained 
from the quadratic form (35.09) and can be represented by (35.15) ; this cor- 
responds to N = 4. In the most general case we have ten g,, and to represent 
them in the form (39.13) not more than ten functions yy are required. (This 
can be rigorously proved.) It should be noted that since the signature of the 
quadratic form (39.12) is (+ — — —) there must be at least one positive 
and at least three negative én. 

We can formally interpret the yn as cartesian coordinates in some many- 
dimensional pseudo-Euclidean space in which the metric is given by 

N 
dn2= Se, dy? (39.15) 

Space-time is then a certain hypersurface in this many-dimensional space. 

To an ordinary contravariant vector A“ in space-time there will correspond 
in the many-dimensional space a vector tangential to the hypersurface and 
having the cartesian components 
OYn 
tes OL, 


(Here and in the following summation from 0 to 3 is assumed for Greek indices.) 
Using (39.13) we thus obtain the following expression for the covariant vector 
components A, : 


At (39.16) 


N 
fay 
Hos > enYn TE (39.17) 
n= 


In analogy with (39.06) we can define the values of the vector components 
after parallel transport to an adjacent point by 
N 


OYn O°Yn 
A Ae Yn2(— + ———- 4 39.18 
a + 34, Ze ‘ (= v dx,0Lg “B ( ) 
N 
22yn (39.19) 
whence 5A, = 2 enYn i, Om, dX, 
and, on insertion of the expressions (39.16) for Yn, 
N 2, 
a > Lp La vee (39.20) 
4 Or, Ox, Ox, - 
n=1 ¥ an 
Now, by analogy with (39.10), it follows from (39.13) that 
N 2 ie 
Cyn Cyn 
on —— =f 39.21 
> Cn ax, 0x, OX, ¥, «8B ( ) 
where the I’, are the ordinary Christoffel symbols 
rn. 1 (= aie Jory “Os (39.22) 
Eh De Oia ox, on, 


Thus the formula for the increment of vector components due to parallel 
transport has the form 
8A, = TV, At Baty (39.23) 


just as for an ordinary surface in Euclidean space. 
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Equation (39.23) contains both covariant and contravariant vector com- 
ponents, but it is simple to state it entirely in terms of one or the other kind 


of components. We have 


At =g"A, (39.24) 
and ; 
Ty, op = Ves (39.25) 

so that 
34, = 1%, 4, 3% (39.26) 
Only covariant components enter these equations. On the other hand we have 

a 
3A, = Gq, 8A" + AY oa S08 = Ty, ag? Bg (39.27) 
and it is easily verified that 
ag 

Ty. +0 aor Be (39.28) 

Hence 
Jay SAT = — Ty, gAY 82g (39.29) 
and therefore in terms of contravariant components the required equation is 
SAY = —I%,A* 82, (39.30) 


We now consider the change in the scalar product of two vectors in parallel 
transport. We have 
§(A°B,) =B, 8A” + A* 8B, (39.31) 
We insert the expression for 8A” given by (39.30) and write 8B, according 
to (39.26) as 
SB, = Tx, B, 8%, (39.32) 
It is then evident that the two terms on the right-hand side of (39.31) cancel, 
giving 
3(A°B,) = 0 (39.33) 
Thus the scalar product of two vectors remains unchanged in parallel transport. 
In particular, the magnitude of a vector is unaltered. 

We have hitherto considered infinitesimal displacements, but by adding 
up such displacements we can define the parallel transport along any given 
‘curve. We assume that the coordinates of a point on a curve are given as 
functions of some parameter p : 

Ig = 24(p) (39.34) 
The Py, being functions of the coordinates, will then also be known functions 
of p. To obtain the vector A” as a function of p we have to solve the differential 
equations 

dA’ as iw fe day 
dp we dp 
If the values of A’ are given at the initial point of the curve we can integrate 
(39.35) and so obtain the values of the A” at the final point of the curve. In 
this way we perform the parallel transport of any vector from the initial to the 


final point of the curve. Evidently the result depends on the nature of the 
curve along which the parallel transport is made. 


(39.35) 
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We compare the equations (39.35) with the previously given equations for 
a geodesic 


as es a) el Fae el ae (38.27) 


eat, (39.36) 


If the geodesic is time-like, so that it describes a point moving with a speed 
less than that of light, the proper time + can be used as the parameter p and the 
vector AY will be the four-dimensional velocity. Thus in this case the equation 
of a geodesic can be interpreted (in four-dimensional terms) as the equation for 
parallel transport of the velocity vector in the direction given by this vector itself. 
The equations for the parallel transport of a tensor of any rank are easily 
obtained from those for a vector. As an example we consider the case of a 
covariant second rank tensor T,,. We start from the requirement that the 
invariant 
I=T,, AXB (39.37) 
should not change in parallel transport, whatever the vectors A* and B’. 
With some relabelling of indices we can write SJ as 


SI = A“B’ (8T,,, — 7,1 % 80 — TU %, Sz) (39.38) 


wa vB 
Since this expression must vanish for arbitrary A* and BY we must have 


87, = TVS 8g + 1%, Sag (39.39) 


which is the required generalization of the equation of parallel transport. 


40. Covariant Differentiation 

In the case when the g,,, were constants it was possible to treat the operation 
of differentiation as a symbolic multiplication by a vector; if, for instance, 
A, is a vector given as a point function in some region then, for constant g,,,, 
the expression VA, = 0A,/dz, is a tensor with the same transformation 
properties as the product of the vectors V, and A,. Consequently in this 
special case tensor analysis and tensor algebra are formally identical. 

For variable g,, this is not so ; the derivatives of a vector with respect to 
the coordinates no longer form a tensor. However, it is still possible to construct 
certain linear combinations of the denvatives of vector components and of 
the components themselves which transform as a tensor. 

We consider a vector field A, defined in some region of space. The vector 
components are then functions of the coordinates. The change of the vector 
in going from a point P(z,) to an adjacent point Q(z, + Sz,) is 

aA 


(A,)g oa (A,)p = 5,4, — ros SZ, (40.01) 
B 


However, we can also compare the values (A,)9 of the components A, at Q 
not with their values at P but with the result (4,)9 of their parallel transport 
from P to Q. According to (39.26) the change of a vector in parallel transport is 


(A,)o —(A,)p = 8.4, = 14,482, (40.02) 
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Subtraction of this expression from the previous one to obtain the difference 


(Ay)g — (A,)o = 5A, — §,A, == §,A, (40.03) 
0A, 
results in dA, = ( Bi, aia ‘| 32g (40.04) 


The quantity 8A, gives the difference between the actual change of the vector 
and the change it would experience in parallel transport. It is at the same time 
the difference of two vectors both referring to the same point Q. Therefore 
the $4, form a vector. For arbitrary da, this can only be true if the quantities 
aA 
V4, = a AIA. (40.05) 
form a covariant tensor of the second rank.+ It is called the tensor derivative 
or covariant derivative of the vector and is the required generalization to 
variable g,, of the ordinary derivative of a vector. 
By analogous considerations we can derive an expression for the covariant 
derivative of a contravariant vector. It has the form 
aA 
OL, 
The formulae of covariant differentiation are easily generalized to apply to 
arbitrary tensors. We consider first the second rank tensor T,,,. According to 
(39.39) parallel transport from P to Q changes its components by 
807, = (M%gT,, +147 ,,,) 8%, (40.07) 
Taking the T,,, to be functions of the coordinates their change in going from 
P to Q is 


V4" = — +1 %,A* (40.06) 


aT, 
7, = Bin zg (40.08) 
The difference ST, = 5:7, — 5,7 ,, (40.09) 
oT 
becomes sT = ( Beg —e,T., — Pu) dz, (40.10) 
and is a tensor for arbitrary displacements dzg. Therefore, the quantity 
aT, 
Voli = Gye — Thal — Male (40.11) 


must be a tensor. 
Similarly one proves the tensor character of 


V7 a 
8 = Be, + Py,rev + Viel? (40.12) 
where 7” is a contravariant tensor and of 
V Tu. —— ory e Tu 
oly = a Tae + Tyre (40.13) 


where i ie is a mixed tensor. 


| ft The fensol character of (40.05) can also be proved without use of the notion of parallel 
eee To oe oe one starts from (37.06) and proves the transformation law (42.04) for 
varistoltel sytabols. One then transforms (40.05) to i i 
formation laws (42.04) and (37.06) pen eee vite Patras 
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By applying the rules of covariant differentiation to either the covariant 
or the contravariant components of the metric tensor expressions (40.11) and 
(40.12) can be shown to vanish. This fact is known as Ricci’s lemma. Because 
of its importance we shall consider its*formulation in some detail. 

If we put T,,, =g,, equation (40.11) gives 


Ay 
Yate = Be alts — ly =0 ao. 
for, written in the fo Guy =T—T T 40.15 
) e form Big oT a ae Se (40.15) 


this equation is an obvious consequence of the definition (39.22) of the Py op 
and was already stated in (39.28). 
If we put T+’ = g*’ in (40.12) we get 


ag" 
ex 


+ gl, + I, = 0 (40.16) 
8 


Vag" = 


This can be verified by using the explicit expressions (38.31) for the I' ya Lhen 
(40.16) reduces to 


agi’ 0g 
+ gregqre eo __ (40.17) 
CL, dL, 


and this is equivalent to 


og ag a 
—_—— HV Bd — 
Dis 2, + 9 


Mg) = 0 40.18 
ae Tag FO (40.18) 


Finally, if T+ = $4, the expression (40.13) reduces to zero by virtue of the 
symmetry of the I’y, with respect to their lower indices. 

Covariant differentiation of a product of two tensors obeys the same rules 
as ordinary differentiation. This follows directly from our method of deriving 
the expression for a covariant derivative, for the rule was obtained from the 
expression for an infinitesimal increment for which the ordinary rule for 
differentiating a product is valid. 

Let us verify the product rule in the differentiation of the product of two 
vectors. Inserting 
(40.19) 


into (40.11) we get 


aU av 
v,(U,V,) = (= — re5U,) V+ v,( y A (40.20) 
a 


and, using (40.05), 
V,(U,.V,) = (VeU,,) - V, + OV, V,) (40.21) 
Applying the product rule to the expression 
U., =GuU” (40.22) 
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and using the fact that the covariant derivatives of the metric tensor vanish, 
we obtain 


VU, = IuvVal™ (40.23) 


Thus we see that in covariant differentiation the g,, behave as constants so 
that they can be taken outside the differentiation symbol. In other words 
it is immaterial whether indices are raised (or lowered) before or after covariant 
differentiation. 

We have just stated explicitly the expression for the covariant derivatives 
of a vector and of a second rank tensor. The covariant derivative of a scalar 
is the same as the ordinary derivative 


V0 = — (40.24) 


This quantity is, as we know, a covariant vector. 
For completeness we write down the general expression for the covariant 
derivative of a tensor of arbitrary rank 


UM — year tm (40.25) 


(v) Vere VE 


which has m contravariant and & covariant indices. We have 


@ ? ww 1 77PM2 «+ Lm Hm 7b «++ Pm—18 
VU) = ag, Upp) + TegUR tm tf epee tm 


(v) (v) eB ~ (vy) 
—_T? (2) = _ TP (4) 
Vive Uv aug et Vou, UY aii (40.26) 


In each term of this sum one index of the tensor being differentiated becomes 
an index attached in the same position to the T-symbol, while in its place the 
tensor itself carries a summation index which is repeated in the opposite position 
as one of the indices of the accompanying I-symbol. One of the lower indices 
of I is always the label of the coordinate with respect to which one is differ- 
entiating. The terms in which a change in a superfix of the tensor is made enter 
with a plus sign and those in which a suffix is involved have a minus. 
Sometimes it is convenient to use a special notation for the operation of 
covariant differentiation combined with a raising of the suffix. The operation 


Ve — g°V a (40.27) 
may be called contravariant differentiation, 
41. Examples of Covariant Differentiation 


We shall now ap 


ply to some special cases the rul i i iati 
yen ee rules of covariant differentiation 


First we calculate the divergence of a vector. By (40.06) the expression 


8a 
‘A> dz, + Tend® (41.01) 
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is @ mixed second rank tensor. Contraction with respect to p. and v gives the 
scalar 
; . oA’ 
Div A = VAY = ap +Tv At (41.02) 


v 


This is a generalization of the expression (21.24) for the four-dimensional 
divergence. It can be simplified as follows. We have 


1/09 og 9g 1 ag 
Ty, = gu, -( pa va) L quy “Suv 
Se af ox, Ye a) 2° aa ere?) 


v a 


and if g is the determinant formed from the g,,, and if in it gg” is the minor of 
9,» then the rule for differentiating a determinant gives 


dg = 99°*d9.,, (41.04) 

1 ag og 
whence -—=g — 41.05 
g Ox, - OL, ( ) 


We note that by (40.18) we also have 


1 a ag’ 
Goa Gi me (41.06) 
1 a 
and therefore | bee Bq = = = lg \/(—9) (41.07) 
Insertion into (41.02) gives ; 
1 7) 
VAY = — {V(-9): 4” 41.08 


We see that the divergence of a vector A multiplied by «/(—g) is equal to the 
sum of partial derivatives with respect to the coordinates of «/(—g) times the 
contravariant vector components. 

If for our vector we take the gradient of some scalar 9, putting 


ra] 
= (41.09) 
0x, 
ap 
vi guy 41.10 
so that A g 2, ( ) 


the divergence of this vector is the invariant expression 


1 a eo 
= (Vi-a) 9" =] (41.11) 
ee /(—-9) e, ( ox, 

This is the generalization of the d’Alembertian (21.27). 

The same divergence may also be calculated by a different method. We 
form first the covariant derivative of the gradient of ». By the general rule 
(40.05) we get 


2 
pepe a ie ao (41.12) 
ww 8x, Ox, Y 0x, 


144 The Theory of Space Time and Gravitation 


and the expression is symmetric in » and v. It can be called the covariant 
second derivative of the scalar 9. If we then form the invariant 


Oe = 9 Guy (41.13) 
d*@ eg 
i = gi — Te — 41.14 
we obtain Oe = g" Be, Oa, r Ta, ( ) 
where we have put for brevity [* = g¥I%, (41.15) 


Since both (41.11) and (41.12) give the divergence of one and the same vector 
they must be equal. It is immediately obvious that the coefficients of the 
second derivatives of @ are equal, but equating the coefficients of the first deri- 
vatives leads to the identity 
at ola in SE i ahs 41.16 
D 5) aa, (V' 9) 9°} (41.16) 
This identity could also have been verified directly. 

We note that in a system of coordinates in which wo, z,, x, and x, are four 
solutions of the equation (lp = 0, the I“ will vanish. Coordinates of this kind, 
which in addition satisfy certain conditions at infinity to be discussed in a 
later chapter, are called harmonic. In ordinary Relativity Theory cartesian 
spatial coordinates together with time are harmonic. 

If a vector A, is the gradient of some scalar the difference of covariant 
derivatives V,A, — VA, vanishes. In general this difference does not vanish 
and can be considered to be the four-dimensional generalization of the curl 
or vorticity of the vector field. We putt 


(Rot 4),, = V,4, — V,A, (41.17) 


By its definition this is an antisymmetric covariant tensor of second rank. 
Using equation (40.05) we can easily see that in the difference (41.17) the terms 
characteristic of covariant differentiation cancel and we have 
o 
(Rot A) A, os 


uv 
ox, Ox, 


(41.18) 


which is the same as the usual expression for vorticity. The latter is thus valid 
in arbitrary coordinates. 


Let us now write A,,, for the symmetric part of the covariant derivative of 
the vector 4. We have 


1 
Ay = 5 (VA, + ¥,A,) (41.19) 
1/24, aA 
or, using (40.05) Hi 230 ( u =| js 
alge ae | ee (41.20) 


With the aid of the symmetric tensor A, we can define the covariant second 
derivative of the vector A, by 
V4, = VA,, + VA, — VA, (41.21) 


Love 
t The symbol Rot will be used for the fo 


ur-dimensiona] icity; i 
retained for the three-dimensiona] case, Seen eae eels 
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Using (40.11) to express the derivatives on the right-hand side we get 


a a 
pv*7o an, en, ore Aj, + we Od — ax, (eA) = ~ (eA) 

we, 
(41.22) 
Similarly the covariant second derivative of a contravariant vector is given by 

7A oA* 0A oAe ale 

V Ae = ° [Te Oe A ee et _ BY Aa 

ey Gx, Ox, ie x, +N ex, Pay ax, v az, e oe 


We now consider the divergence of a second rank tensor, which we shall 
write in contravariant form. According to (40.12) we have 


eT 
VT = + Tyr (41.24) 


Transforming the third term with the aid of (41.07) we can write 


1 7) 
uv __ pay a y ed 
Vie Vea) in/(—9)- 2) + PT (41.25) 
Putting T*” = g*’ and remembering that the covariant derivatives of the metric 
tensor vanish we again get the identity (41.16). If T¥’ is antisymmetric the 
last term in (41.25) vanishes, so that the divergence of an antisymmetric tensor 
is like the divergence of a vector in being a sum of derivative terms. For a 
general tensor, on the other hand, such a representation does not exist. 

We now write down the expression for a covariant derivative of a covariant 
second rank tensor together with two expressions obtained from the first one 
by cyclic permutation of suffixes : 


oF, 
V —TeF, —Te 
Fy, = Ht — Tho F yy — TF 


o~ pv vo" Lp 
OF 
Vie = 3a, ee a a (41.26) 
OF 
VP ois => ae a i i Del as 
Assuming that F,,, is antisymmetric, Le. 
| ates i (41.27) 
find that V.F,, + V.F,, + VFou — Fy) Oe | Aon 41.28) 
we hn a Pers Fae: — Ga, + On, 7 On, ( 


because the terms not involving derivatives cancel in pairs. The expression 


oF 
pw Fw, Og | Hon (41.29) 


HNO OL, OX, on, 


which is antisymmetric in all three suffixes is therefore a tensor of third rank 
which may be called the cyclic derivative of the antisymmetric tensor F,,, 
We have previously encountered such an expression in Section 24. 
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The cyclic derivative of any antisymmetric tensor is related to the divergence 
of the antisymmetric pseudo-tensor dual to it. As in (37.67) we introduce the 
dual pseudo-tensor by the relation 


1 
i? = 5 EF s (41.30) 
so that ; 
* 
V(—9)- FY = Fig3 -V(—9)- PF = Fas V(—-9) P= Pi (41.31) 
* * é 
V9) F8 = By; V(—g)-P = Fae V(—9): P= Pro 
We also introduce the pseudo-vector dual to F,,,, a8 in (37.68) : 
x 1 
fe = 6 Bree Pg (41.32) 
so that 
* 
_9)-F° = —Fyo3; 
ve = . (41.88) 


* * 
V(—9) F? = Fos; V(—9):F? = Fao; /(—9) FF? = Fieo 
If F,,,, is the cyclic derivative of F',, we have 


1 atvi-)-F} 
V(—9) Og 


Le. the pseudo-vector F* is the divergence of the pseudo-tensor Fe, 

The same result is easily obtained without considering individual com- 
ponents by using the fact that the covariant derivatives of H°°v* and also of 
Eo) calculated by the general rule, vanish identically. 


(41.34) 


42. The Transformation Law for Christoffel Symbols and the Locally 
Geodesic Coordinate System. Conditions for transforming ds? to 
a Form with Constant Coefficients 
Covariant, or tensorial, derivatives differ from ordinary derivatives by 
terms involving the Christoffel symbols 


nist wv (Gan , Gor Yap 
M59 (= + fe 3) (42.01) 
If at some point z, = 2° all the Christoffel symbols vanish, the expressions for 
tensorial and ordinary derivatives are the same. We shall show that in the 
neighbourhood of any point it is possible to introduce a system of coordinates 
such that all the Py, vanish at the point. Then, by (40.14) and (40.16) all 
derivatives of the metric tensor with respect to coordinates will vanish at the 
point in question. 

We first establish the law according to which the Christoffel symbols trans- 
form when one passes from the original system of coordinates (pe Bis, Dax Bs) 
to some new system (x, ,, 2, £4). This law could be derived directly from the 
definition (42.01) of the Py, using also the transformation law for the metric 
tensor. It is, however, simpler to argue as follows. We know that the quantities 
an a 
ae ae (42.02) 


Puy = 


[2 


General Tensor Analysis 147 


represent a tensor. This means that for any function » and any coordinate 
transformation the equations 
2, 2 t , 
1, = (Ste B) SSF 4203) 
én, az, “dx, \éa ex, * ™ Ox3) ox, ox, 
are valid where the (I'$,)’ are Christoffel symbols in the primed coordinate 
system. Putting p = x, we get 
O*g = Te Ora. — (i 5 ea 
ax, Oa, ” dz, a8" Ox 1, OL, 
This is the required transformation law. The presence of the term involving 
the second derivative shows that Ig, is not a tensor. If the transformation 
under consideration is linear the term mentioned is absent and, therefore, for 
linear transformations the I'¢, behave like a tensor. 
Let the values of the [4 atthe given point be (I?,),. The primed quantities (T'2,)’ 
will vanish at the point if the transformation of coordinates satisfies the relations 


(42.04) 


O22" ax’ 
( 2 ) —(T*), (=) AG (42.05) 
Ox, ox, 0 i ox, 0 

These relations will be satisfied if we put 
, 1 . 
tz = 2, — n i 3 (Tav)o (x, = a) (x, x) (42.06) 

For this transformation 
ax’ 

LS) == $9 42.07 
(=) =2 (42.07) 


(-) 
and, therefore, the values at the point of the components of any tensor will be 
the same in the primed and the unprimed system. In particular there is no 
change in the components of the metric tensor ; the derivatives of this tensor 
on the other hand all transform to zero. This fact may be utilized to simplify 
calculations involving tensors, for it follows that if some quantity is known 
to be a tensor and also known to vanish if a9,,/@z, =0 the quantity must 
also vanish without that condition. 

It can be proved that with a suitable choice of coordinate system the deriva- 
tives of the g,, may be made to vanish not only at one point, but even along 
any given line [14]. 

A coordinate system in which the derivatives of the g,, vanish at a given 
point is called locally geodesic. This name is justified by the fact that in such 
a system the equations of a geodesic reduce at the point in question to the 
statement that the second derivatives of the coordinates with respect to the 
parameter p vanish. (In the vicinity of the point these derivatives will be 
first order infinitesimals.) Hence the coordinates there will be linear functions 
of p up to terms of third order. 

We now ask the question : What is the condition for the existence of a co- 
ordinate system (z,, z;, 2,, Z,) in which the Christoffel symbols vanish not only 
at the given point or along some line, but even in some finite region? 

If such a coordinate system exists the equations 

OP a eG (42.08) 
dx, Ox, “ G2, 


148 The Theory of Space Time and Gravitation 


must have a solution, because they are satisfied by the functions 
=%> 9=%; =i P=% (42.09) 


For all the equations (42.08) to be compatible it is obviously necessary that 
expressions for the same third derivative calculated from one or other of the 


equations must coincide. We have 


f(y 
Ox, ox, Ox, éx,\ * ox, 


zie) ~ (a) 
Ox,\ OL, OL, Ox, ox, 


Here the left-hand sides are equal, therefore also the right-hand sides. We 
equate the latter, perform the differentiations and express the second derivatives 
in terms of the first, getting 


(42.10) 


ale ole eg 
Aw Sue i pepe pepe | 9P _ 9 (42.11) 
(= dx, 1 Mw! ox Bee Oe 

These equations must be valid for ¢ = z,, 9p =2,, p =z, and 9 = 2,. Since 
the determinant 


D = D (2p, H, Lo, 23) (42.12) 
D (So, £1, Lp, £3) 
is not zero all the coefficients of the 6p/éx, in (42.11) must vanish. These 
coefficients are 


ae, = are 
Bh te ee ae ofTe _ [se [Te 42.13 
Ri, va Ox ex, at; bee es ov ( ) 
We now prove that the conditions 
Re =0 (42.14) 


are not only necessary but also sufficient to ensure that the system of equations 
(42.08) has a solution. To do this we put 


op 
es 42.15 
Py 2a, ( ) 
} eg, 
and write (42.08) as oS Fe (42.16) 
u 


Let the values of the 9, be given at some point which has the coordinates ue 
To get the values for an arbitrary point x, we join the two points by some curve 

m, = E (p) (42.17) 
where p is a parameter, and we consider ?,, and also Te, as functions of p. The 
, are then determined by the system of ordinary differential equations 


dg, 
dp Tine"? (42.18) 


where the dot denotes differentiation wi 
This system uniquely determines the 
curve. It remains to show that the values 


th respect to the parameter p: 
values of 9, at the end point of the 
of ~, so found do not depend on the 
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shape of the curve joining the initial and final points. To do this we consider 
an infinitesimally differing curve joining the two points : 


t, = &%(p) + 8&%(p) (42.19) 


Here 95& is an infinitesimal vector which becomes zero at the initial and final 
points. We denote the values of g, on the varied curve by 9, + 39,. 
The equations for g, + 3¢, will evidently be of the form 


d ele : . 
ae (p, + 89,) = (r:, + ee. Be) + d&)(p, + 89,) (42.20) 
Subtracting (42.18) from this and neglecting higher order infinitesimals, we 
get 

2 ea se 
Ae OPS" FTL EY, + TY,9,55" (42.21) 


Using (42.18) once again and introducing the notation of (42.13) we can put 
this into the form 


a 


d ; 
Jp OP — DRE") = Re a9, £86" + TEEMBe, —Ps,9,86) (42.22) 


For brevity we write my = 89, — P8,9,88* (42.23) 
Then, if (42.14) holds, (42.22) takes the form 


any _ [Te Ethy (42.24) 


Thus the equations for y, have the same appearance as those for 9,; both 
represent equations of parallel transport of a vector along the chosen curve. 
The y,, just as the »,, are uniquely determined by the initial conditions. But 
the initial conditions for the y, are simply that they vanish. For the coordinates 
of the initial point are fixed, so that there 8&* = 0, and the value of ¢, at this 
point is also fixed, giving 59, =0. Hence y, = 0 at the initial point ; with 
.this condition it follows that along the whole curve 
nc) (42.25) 
and therefore Sp, = T2985" (42.26) 


We now consider the final point of the curve. Its coordinates are also fixed, 
giving 5€* = 0, consequently, by (42.26) we must also have 89,=0. This 
means that the function takes on the same value at the final point whether it is 
calculated using the initial curve or the infinitesimally varied one. Deforming 
the curve continuously we shall then also obtain the same result for any two 
curves joining the same two points, not only for curves differing infinitesimally. 
It follows that in any simply connected region the ¢, are single valued point 
functions determined by their values at an arbitrary initial point. 
By virtue of the differential equations (42.16) we have 


do, a 
ves ae (42.27) 


ou, On,, 


because the es are symmetric in their lower indices. It follows that the 
expression 
do = 9, da, (42.28) 
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is a total differential. Integrating and fixing the constant by the value of » 
at the initial point completes the determination of 9. ; 

We have proved that the necessary and sufficient condition for a solution of 
equation (42.08) to exist is the vanishing of (42.13), and that @ 1s determined 
apart from an additive constant by the values at one point of its partial 
derivatives with respect to the coordinates. 

If two, not necessarily different, solutions » and of equations (42.08) are 
given, the expression 

oy = gn (42.29) 
i Ox, OX, 
remains constant as a consequence of these equations. To prove this it is 
sufficient to form the covariant derivative of the scalar (42.29) and to verify 
that it vanishes. (As we saw, (42.16) is just an expression of the fact that the 
covariant derivatives of the ~, vanish.) 

We denote by 2,, x,, x, and x, four solutions of (42.08) which are so chosen 

that at the initial point one has 


, i 
s OL, OX. 2. 


(42.30) 


ay 8 
ox, Ox, 2% 


(no summation over «). Then (42.30) will also be true for all values of the 


coordinates. Hence it follows by purely algebraic reasoning that 
3 


Ox, Ox, 
Jog = > te be (42.31) 


i.e. that g,, can be represented in the form (35.15) and that therefore ds? can 
be brought to the form (35.09). 


Thus the necessary and sufficient condition for the quadratic form 


ds? = 9,, dx, dag (42.32) 
to be reducible to 
ds* = (day) — (dx}) — (das)® — (dx)? (42.33) 
is the vanishing of the expression 
ae, =a, 
i" va. el oa —= ban, + ea ke as ree, (42.13) 


formed out of the Christoffel symbols belonging to the quadratic form (42.32). 


43. The Curvature Tensor 
In the foregoing section we introduced the quantity 


‘ av, ore, 
Bie gee ag Pee = Pea (43.01) 


This quantity pla 
theory of gravita 
detail. We first p 
The most direct 


ys an important role in general tensor analysis and in the 
tion. It is thus necessary to investigate its properties in 
rove that it is a tensor. This can be done in different ways. 
method of proof consists in differentiating the equations 
02z," ax’ j 
o re oO She __8 
Ox, Ox, Oa, Uo) dx, Ox, oe) 
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which state the transformation law for Christoffel symbols as it was given 
in (42.04). The result of differentiating (43.02) with respect to x, can be written as 


Ox, Ox, On, + Wes) ( 


re. az}, are)’ Gx, Ox, 6x, 
= ny I* T BY tT \e io _-“B 
( 2x, oe pv nl ge ex, —( an, + (TG, (Is es) le On, ex, Oa, (43.03) 


ex, 0°, on, 0x4 ax, 0*x, 
Cx ox, Ox, Ox, Cx, OD, Oz, Oxy 0x, 


Here the left-hand side is symmetric in A, u and v and therefore the right-hand 
side must also have this symmetry. Interchanging A and v on the right and 
equating the expression obtained to the original right-hand side, we get an 
equation which, in terms of (43.01), can be written as 

ez’, or Ot, Oty Oar, 


Re vA az, = (Re. me) 


ae (43.04) 
This equation indeed expresses the fact that Re, is a fourth rank tensor 
which is covariant in p, v and A and contravariant in p. This tensor is called 
the curvature tensor. 

With the aid of the curvature tensor one can give an expression for the 
change of a vector in parallel transport around an infinitesimal closed contour. 
Let the value of the vector A, at the starting point 22 be (A,)9. If it is trans- 
ported to an adjacent point its components will take on the values 


A, = (A,)o + (T'2,)0 (Ag)0 (2, — 22) (43.05) 


apart from quantities of second order of smallnesst in the z, — 22. It is thus 
obvious that the changes AA, undergone by the vector components A,, when 
going round an infinitesimal closed contour will be at least of second order 
in the greatest displacements occurring. These changes can be expressed as 
the line integral 

= | Te A. da, (43.06) 


pv*"e 


taken around the contour in question. For an infinitesimal contour the A, 
under the integral may be replaced by (43.05) and the I'¢, by 


ore 


re, = (T*,)9 + (5) (2 — 28) (43.07) 


Ox, 
Inserting (43.05) and (43.07) into (43.06) and using the fact that the integral 


of a total differential around a closed contour vanishes. we get 


ore 
AA, = (= = wy rere.) (Ande | (x, — x°) dz, (43.08) 
The quantities 
1 
OF = Je — 2°) dz, = 5 | [(w, — 2°) dx, — (x, — a8) dx] (43.09) 


+ Such second order quantities will depend on the nature of the curve along which the 
displacement is made, because the vector Ap is not considered as point function. 
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can be interpreted as the projections of the surface enclosed by the contour 
on the coordinate “ planes”. It is easy to verily that they form a contra- 
variant antisymmetric tensor. Owing to this antisymmetry equation (43.08) 


can be written as 
1 
AA, = 5 F. vat @ (43.10) 


Here we have suppressed the suffix 0 which characterizes the initial point. 

By similar arguments the change in the contravariant components of a vector 
can be shown to be given by 

Le =e a Qe (43.11) 

We already know that Re ., is a tensor ; this can also be seen from (43.10) 
and (43.11), for AA, is the difference of two vectors referring to the same 
point and is thus itself a vector. On the other hand, if an arbitrary symmetric 
part is added to the antisymmetric tensor Q™* equation (43.10) does not change. 
Therefore the right-hand side of (43.10) is a vector for any tensor Q’* and any 
vector A,. This is possible only if Ag ,, is a tensor. 

In Section 40 we introduced the operation of covariant differentiation. In 
general, covariant differentiations do not commute, the second covariant 
derivative of a vector or a tensor taken first with respect to z, and then to 
%, is not equal to the second derivative with respect to x, and then %,. We now 
consider the difference of these two derivatives for the case of a vector. Accord- 
ing to (40.05) we have 

V,4, = se — Wied, (43.12) 
B 
We calculate V,V,A, in a locally geodesic coordinate system, which is one in 
which the first derivatives of the y,,, vanish at the point considered. We have 
Ay ONG 
Ox, OLg oz, 


Vi¥e4, = (43.13) 


v 


ary oly 
whence V,V,4, — V,V,4 =(5" — i) 4 
a’ Bt" BT at**y Ox, ox, a] (43.14) 
In the locally geodesic system the factor of A, on the right-hand side coincides 
with the expression for the curvature tensor 
2a aly 


7 ba LB 2 = \ 
wa Be Gn, Talon — Mel Se (43.15) 


Therefore, in a locally geodesic system the equation 
a (V,V, — V5 V,) 4, = RY, 44, (43.16) 
is valid. Now, both sides of this equation are tensors and therefore (43.16) 
must also be true in an arbitrary system of coordinates. The expression (43.16) 
for the difference of second derivatives of a vector is thus of general validity. 
The corresponding equations for a contravariant vector 

. (VV, — VaV,)A¥ = — RY A" (43.17) 

can be derived similarly, 
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The technique used here of first stating an equation between tensors in a 
locally geodesic system and then concluding that it is generally valid is very 
useful for abbreviating the derivation of results. The equations (43.16) and 
(43.17) are so simple that they can also be obtained quite easily without using this 
technique, but there are other cases where the simplification is more substantial. 

Let us, for instance, consider the expression for the difference of second 
covariant derivatives of a tensor of arbitrary rank. 

(4) __ ieaj 
08 = US te (43.18) 
Equation (40.26) for the first derivative can be written in the form 


VU 2 7, Ue jee > yey Hi —1 041 + en 


8B’ () 
= Ue 
> ayy + +Vj—1PVj41+++VkE (43.19) 


Forming the second derivative in a cease system gives = 


(u) __ ye) re H+ +-i—10Vita <n 
vavB Uy = Be, 2a, Ty 2a 
k 
ya Few (43.20) 
++Vj—1PVj+1---Vk 
oe j=1 


Therefore, the difference of second ae in a geodesic system coincides 
with the expression 


(pw) _ Miya ++ e104 +++ Um 
Wg Neer 0c Fey 
) 
a prs pl uy eee Vj—1PYJ41 ++ VE (43.21) 


But since both sides of this equation are een sei) must be correct in all 
coordinate systems. 


44. The Basic Properties of the Curvature Tensor 
Side by side with the mixed curvature tensor 
ar? 2) ie 


Re, «2 = Big _ ax, —H + = Pe alae (44.01) 
we shall consider the covariant tensor 
Ruy, ap” = Iook ra, ad (44. 02) 


According to this definition the mixed tensor R%,, is obtained from K,,, .5 
by raising the second covariant suffix. Therefore, a more precise notation for 
it would be 
Re ag = FE aa = I Rus, of (44.03) 
We note that in older literature the covariant and the mixed tensors used to be 
denoted by the symbols 
Raa = (LV &B) (44.04) 


and Re, a = {u0, aS}, (44.05) 
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which were known as Riemann’s four-index symbols of the first and second 
kind. 

Evaluating the covariant curvature tensor by (44.02) we get 


é 9 29, 
Bos ea = ees») — tov) + Tyee, 4 — 22) — Pe((e0. v) - 2) 
Te ve o 


uw Oats 
(44.06) 
or finally 
1/ eg. 879,14 2°94 8°9 va 
=e va _ 78 ee} Te vB, op] + Mealve, 
Row ap sae 02, +g, on, 2, On, 4, Oy palB, 0] + Palo el 
(44.07) 


Here the last two terms can be expressed in terms of the It, giving 


l/ a9 8°95 29. a9, rr 
ae va oh Se oe o, + eT? 
uy aB slap 8x4 a ax, az, en, az, ax, ex, Joc pa’ vB Jost ap a 

(44.08) 


From this expression the following symmetry properties of the tensor can be 
deduced : 
1. Antisymmetry in its first two suffixes 


Buu a = Pay, ap (44.09) 
2. Antisymmetry in its last two suffixes 
Ray, pa = Puy, of (44.10) 
3. Cyclic symmetry 
BR, ap + Rua, By + Rip, va = 9 (44.11) 


The first two properties are obvious. To verify the last it is sufficient to write 
the left-hand side of (44.11) in a locally geodesic system and to verify that the 
twelve second derivatives that occur cancel in pairs. 

From the three stated properties it follows further that we can interchange 
the first pair of suffixes with the second, without any change of order within 
either pair : 

Ras, w = Rwy, ap (44.12) 


This last property also follows directly from the definition (44.08) ; to show 
that it is not independent, but a consequence of the other three, we can add 
together equation (44.11) and three other equations obtained from (44.11) by 
cyclic permutation of the four suffixes (u, v, «, 8). If the antisymmetry pro- 


perties 1. and 2. are used eight of the twelve terms in the sum cancel in pairs, 
the remaining four giving 


—2Rua vp + Bye ua = 0 (44.13) 
This differs from (44.12) only in the naming of suffixes. 

We now count how many independent components the curvature tensor 
possesses ; we do this on the assumption that each suffix may take on n values 
so that actually n = 4. Evidently all four suffixes cannot be equal. All non- 
vanishing components having only two distinct suffixes reduce to the type 
Ry as Their number is the number of pairs of unequal suffixes, i.e. $n(n — 1) 
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Further, if a triplet of different numbers, «, B, y is given—and there are 
gn(n — 1)(n — 2) such triplets—the non-vanishing tensor components that 
may be formed from it are Ryg os Rogoy i Ryayg Which respectively have the 
first, second or third of the numbers «, 8 and y repeated. The number of such 
components will be three times the number of triplets, ice. 4n(n — 1)\(n — 2). 
Finally, there exist Ayn(n — 1)(n — 2)(n — 3) combinations of four different 
numbers, «, 8, y and 8. With each such combination one can form components 
Rup ye Ryspy and R,, 5, while all other combinations of suffixes reduce to 
these three. However, these three components are not independent because 
they are linked by the cyclic symmetry condition, leaving only two independent 
quantities. Therefore, the number of independent tensor components having 
four distinct suffixes is twice that of quartets of numbers, ie. is equal to 


zyn(n — 1)(n — 2)(n — 3). The total number of independent components is 


5 — 1) + 5nln — Il — 2) + mln — lm — 2m — 8) = nut — I) 


12 
. (44.14) 
In the case of interest n = 4 so that the number is 


6+124+2=20 (44.15) 


One might note that in three-dimensional space (n = 3) the number of indepen- 
dent components of the curvature tensor is 6, which is also the number of 
components of a symmetric second rank tensor. In fact, for m = 3, the curva- 
ture tensor can be expressed in terms of a second rank symmetric tensor, as 
shown in Appendix E. Finally, the curvature tensor of a two-dimensional 
surface (n = 2) has only one component—the Gaussian curvature. 

We note that in Section 31 we have already encountered quantities with the 
same symmetry properties as the covariant curvature tensor and we observed 
there the connection of such quantities with Krutkov’s symmetric tensor. 

We have studied the properties of the covariant curvature tensor. The 
mixed curvature tensor has analogous preperties 


Ripa = — Fi ap (44.16) 
and | a a Re ay + Re n= 0 (44.17) 


corresponding to (44.12) and (44.11). As regards the property corresponding 
to (44.19) it takes on a somewhat more complicated form, namely 


CP a 9 aB o> i aB (44.18) 


(It means that the raising of the first suffix and the lowering of the superfix 
results in a change of sign of the component.) This property can easily be 
derived independently of (44.09) by comparing (43.10) and (43.11) or (43.16) 
and (43.17) or alternatively by use of the equation 


(V.Vp —= VaV)Iuv = Re «69 pv ae Re «BI op = 0 (44.19) 


which follows from the general rule (43.21). 
In addition to the algebraic identities just discussed the curvature tensor 


satisfies some differential relations which can be written as 


V,R a a Vit, aB oF VB, «Bp — 0 (44.20) 


[Ly, a 
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and are called the Bianchi identities. To verify them we introduce a locally 
geodesic coordinate system, in which covariant and ordinary derivatives can 
be identified. The calculations are simplified by the fact that in the expression 
(44.08) for R,,., the terms not containing second derivatives are quadratic in 
the P¢,, so that not only these terms themselves, but also their first derivatives 
become zero in a locally geodesic system. In such a system the left-hand side 


of (44.20) is equal to 
é 0 7) 


Bay, af oe aa, aB aa Ga af 
— ( & va P95 P94 wie 9,5 
1 On, OX, 0%, Ox, Ot, ON, 6x, OX, OX, «ON, Ou, OH, 
aq OO | 1 
4 7 Iva +) —=(...) (44.21) 
Ox, 02, CL, Ou, ON, Ox, 2 


where the dots denote terms of the same structure as those given but with 
a and £ interchanged. Now, the expression in the bracket is itself symmetric 
in x and 8 so that the two brackets are equal and their difference vanishes. 
This proves that (44.20) is valid in a locally geodesic system and thus, being 
a tensor equation, it must be valid generally. 

By contracting with respect to two indices one can construct a second rank 
tensor out of the fourth rank curvature tensor. Contraction with respect to the 
first two, or the last two suffixes evidently gives zero, all other contractions 
give one and the same result, apart from a sign. In this way we get the tensor 


R,,, = g@R Re (44.22) 


Ww, BY 
which is called the second rank curvature tensor or the Riemann tensor. It is 
easy to see that the Riemann tensor is symmetric. For, using (44.09), (44.10) 
and (44.12) we get 


ua, By 


9 Ra, av = gy? Re, wa g Big au g Ry on (44.23) 
or R., = Ri. (44.24) 


Side by side with the covariant Riemann tensor one considers also the mixed 
tensor 


RY = gR.,, (44.25) 
and the contravariant tensor 
Rw gg, (44.26) 
Further contraction, with respect to uw and v leads to the scalar 
R= Ry = gR,, = ggR 
which is called the scalar curvature. 
We now calculate the divergence of the Riemann tensor, 


ya, By (44.27) 


aa = VR => gV,R., (44.28) 
Introducing (44.22) for R,,, we get 


VV Rae (44.29) 
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To the covariant derivative in the above we apply the Bianchi identities written 
in the form 


Vi Rue pv + Velev + Vue ag = 0 (44.30) 
which is obtained from (44.20) with the aid of (44.12). This gives 
Y, = — 99 VaRua, va — 9 OPV Rie, 28 (44.31) 
This relation can be rewritten as 
Y,=—-Y,+VR (44.32) 


for if the summation indices in the first term on the right of (44.31) are renamed 
so as to exchange A with @ and u with « it becomes the same as (44.29). In the 
last term of (44.31) the summation may be performed before the covariant 
differentiation, reducing that term to V,R. Thus we find 


1 OR 
3 Ba, 
in view of the fact that for a scalar the covariant and the ordinary derivative 


coincide. Comparing (44.28) and (44.33) we see that the divergence of the 
tensor 


(44.33) 


v 


1 
=-VR= 
¥,=5VR 


1 
By = Buy — 3 9F (44.34) 


vanishes identically. For this reason the tensor G‘,, is called the conservative 
tensor. As it plays an important part in Einstein’s Theory of Gravitation it is 
also called the Einstein tensor. 

Further transformations of the curvature tensor will be postponed until 
Chapter V, which deals with the theory of gravitation. 


CHAPTER IV 


A FORMULATION OF RELATIVITY THEORY IN 
ARBITRARY COORDINATES 


45. Properties of Space-Time and Choice of Coordinates 


The form of the equations that describe the course of a physical process in 
space and time depends not only on the specific nature of the particular process 
but also on two further circumstances : the properties of space-time and the 
coordinates chosen. The properties of space-time are objective, they are deter- 
mined by Nature and do not depend on our choice. In contrast, the use of one 
system of coordinates or another is to a very high degree a matter of our choice. 
It is true that there is a certain limitation to arbitrariness, in so far as the 
existence of certain preferential coordinate systems, namely the Galilean ones, 
is only assured by reason of the objective properties of space-time ; such co- 
ordinate systems would not exist if these properties were different. However, 
it is always possible, by a mathematical transformation, to pass from a pre- 
ferential coordinate system to any other arbitrary one. We have studied the 
rules for doing this in the preceding chapter. 

To detach oneself from the specific characteristics of any particular process 
one must consider those equations which are of greatest generality and which 
characterize most directly the properties of space-time. The equation that 
states the law of propagation of a wave front travelling with the limiting 
velocity is of this kind. In the first place, this law is obeyed by light, ice. 
electromagnetic waves in free space, but we have already pointed out that it 
should not be regarded as referring specifically to light, but as a general law to 
which any disturbance travelling with limiting velocity is subject. The equa- 
tion for the propagation of a wave front in free space characterizes not only the 
properties of the kind of matter being propagated (e.g. of the electromagnetic 
field) but also the properties of space-time itself. (We have repeatedly pointed 
out that the practical measurement of great distances is based on triangulation 
and on radar, i.e. also on the use of the laws of electromagnetic wave propaga- 
tion.) Consequently, the concepts of geometry and the notion of time are very 


closely connected with the law of wave front propagation in free space. 
In Galilean coordinates 


t=c; ga: t%=Y; =z (45.01) 
this law is expressed by the equation 
(=) (3) (3) Ow \ 2 


where w = const. is the e 
front. Equation (45.02) 
wave surface moves in th 


equation of the moving surface forming the wave 
1s @ mathematical statement of the fact that the 
e direction of its own normal and with the speed. of 
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light. If we identify the normal to a wave surface with a light ray and consider 
the point of intersection of ray and front, equation (45.02) permits us to assert, 
that in Galilean coordinates this front moves along the ray in a straight line 
and uniformly with the speed of light. 

Side by side with the propagation of a wave front we can consider the 
simplest process in which a motion with a speed less than that of light is 
realized. This is the free motion of a mass point. In the form due to Hamilton 
and Jacobi its equations of motion take on a form similar to (45.02), namely 


(F5)" (2)" (F:) (=) i 45.03 
aa) — \eai) —\act) —\aai) = 2) 


where « 1s proportional to the action function. Indeed, equation (45.03) has a 
complete integral of the form 
() = 42, + Gx, + age, — »/(1 + a} + af + a2) - 24 (45.04) 
If the derivatives of w with respect to a,, a), @3 are equated to constants the 
resulting equations describe uniform rectilinear motion with less than light 
velocity. Thus we can take (45.02) and (45.03) to be the equations which reflect 
most directly the properties of space-time. 
The differential expression 


wore (2-8 -AY AY wm 


which enters these equations characterizes both the properties of space-time 
and the physical significance of the coordinates. 

If we did not know the significance of the variables (x), x, x,, 2,) and knew 
only that they were in some way related to space and time (or, mathematically 
expressed, that they served to arithmeticize space and time) we would be able 
to obtain their physical interpretation by considering processes described 
by the equations (45.02) and (45.03). In fact this is how we proceeded in con- 
structing the Theory of Relativity. We established the physical meaning of the 
variables (45.01) in two stages. First we made use of those not completely 
accurate concepts of space coordinates and of time which were accepted in 
classical pre-relativistic physics. The provisional definition of time linked the 
variable ¢ with the motion of a clock or the course of any other periodic process 
and the definition of the space coordinates related the variables x, y and z to 
distances measured with the aid of solid bodies on the basis of Euclidean 
geometry, i.e. to a coordinate mesh. But we stressed from the start that these 
definitions needed to be made more precise. 

Indeed, the use of clocks at different points in space requires a solution of 
the problem of their synchronization and the use of rigid rods for measuring 
large distances is not only impossible in practice but also open to objections of 
principle. 

The definition of physical quantities is never arbitrary ; it always reflects 
Nature with greater or lesser accuracy. Therefore it is possible to make defini- 
tions more precise only on the basis of a deeper knowledge of Nature. The 
particular definitions we are here concerned with, those of the space and time 
coordinates, are supported not only by the law of the uniform rectilinear motion 
of a free body, already invoked by Newton—but also by the firmly established 
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law for the front of a light wave, i.e. by (45.02) and (45.03). These were just the 
lines we followed in developing the Theory of Relativity which is essentially a 
theory of space and time. (The designation “ Relativity ” has historic reasons 
and gives only a one-sided idca of the content of the theory.) The Lorentz 
transformations, so fundamental for the Theory of Relativity as the expressions 
of the nature of space and time and of the precise meaning of the variables 
x, y, 2 and t, are indeed derived from the law of wavefront propagation, together 
with the requirement of conservation for uniform rectilmear motion ; they are 
not given a priort, before this law is established. 

Thus both the properties of space-time and the significance of the Galilean 
coordinates (45.01) are derived from equations (45.02) and (45.03). 

Both the wave front equation and the Hamilton-Jacobi equations for a free 
mass pomt may, however, be written in more general form : 


an a 

Fae nee (45.06) 
ox, on, 
ae 

= pO OO esa (45.07) 
Gu, Ox, 


and the statement just made remains true : both the nature of space-time and 
the significance of the coordinates (ao, £1, %, 23) may be determined from these 
equations. 

Let us assume that the g’ are given functions of their variables. Which 
properties of these functions reflect properties of space-time itself and which 
arc determined merely by the particular choice of coordinates? 

If we admit that the objective properties of space-time are correctly described 
by the usual Theory of Relativity, these properties may be expressed by the 
statement that there exist Galilean coordinate systems in which equations 
(45.06) and (45.07) assume the forms (45.02) and (45.03). 

This assertion means that there exists a substitution 

oT lte Sr, Ba) a8) (a = 0, 1, 2, 3) (45.08) 
which reduces the expression 


Gu do) 
Va)? = quy 
(Vo)? =g &, a (45.09) 
3 ow \2 
to the form (Vo)? = > ex( | (45.10) 
= OL, 


We know from general tensor analysis that a necessary and sufficient con- 
dition for the possibility of reducing the quadratic form (45.09) to one with 
constant coefficients is the vanishing of the fourth rank tensor of curvature 
formed from the g“” and the related 9,,- In order that the transformed quadratic 
form should have one plus sign and three minus signs attached to the squared 
partial derivatives, it is necessary, in addition, that the inequalities stated in 
Section 35 be satisfied. 


Thus, the equations that must be satisfied by the g"” are 
Rig =O (45.11) 


3 
and 9° >0;, > gikkE, <0 (45.12) 
k= 


i,k=1 
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where &,,& and &3 are arbitrary numbers. These, then, are the equations which 
express, independently of any choice of coordinates, the properties of space- 
time in the approximation in which the ordinary Theory of Relativity is valid. 

If these equations are satisfied, the form of the required transformation 
(45.08) is determined uniquely except for a Lorentz transformation. The 
quantities x, = f, must therefore be interpreted as Galilean coordinates, and 
this also gives the interpretation of the variables (xp, 2, 2), £3), in which the 
equations were originally stated. 

In the foregoing discussion we assumed that the g*” are given functions of 
the coordinates, but we can also adopt a different point of view and consider 
them as unknown functions subject only to the conditions (45.11) and (45,12) 
which express the nature of space-time. 

Solving these equations one finds for the covariant components of the metric 
tensor expressions of the form 


(2 SQ (45.13) 
ko ee 

involving four arbitrary functions: f,. (The appearance of arbitrary functions in 
the solution is connected with the fact that the equations for the g,, are covariant 
with respect to arbitrary coordinate transformations.) One or other choice 
of the arbitrary functions f;, can in no way influence the physical consequences 
of the theory, for it corresponds to a purely mathematical transformation of 
the equations from one set of independent variables to another. It is, however, 
appropriate to limit the choice of arbitrary functions in such a way that the 
permitted transformations form as restricted a group as possible and that the 
basic equations appear in the simplest possible form. If such a limitation is 
possible—and this does depend on the objective properties of space-time—the 
preferred coordinate systems obtained in this way will have a more direct 
physical interpretation. In the case here considered a Galilean system is so 
preferred ; it is obtained if in the general solution of (45.13) we put f; = 2x. 
It is, nevertheless, also possible to leave the functions f; indeterminate and 
to formulate the equations for physical processes without presupposing a 
solution of the problem of choosing independent variables. Then, for instance, 
the law of wave front propagation will be expressed by the system of equations 
(45.06), (45.11) and (45.12) for the unknown functions w and g"’. The law of 
motion of a free mass point js expressible similarly. Other examples of such a 

generally covariant formulation follow in the next section. 


46. The Equations of Mathematical Physics in Arbitrary Coordinates 


If for some system of differential equations of mathematical physics the 
tensor form in Galilean coordinates is known, the corresponding equations in 
arbitrary coordinates are obtained simply by replacing ordinary by covariant 
derivatives. This rule is even applicable to equations involving second or 
higher derivatives because, if R,,,. = 0, covariant differentiations applied 
to any vector or tensor are commutative by the general rule (43.21). 

We consider first the-equations of electrodynamics. 


Since the linear differential form (24.04) 
Sp = A, da, (46.01) 
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is an invariant the components of potential occurring in it form a covariant 
vector even for general coordinate transformations. The differential relation 
(24.05) which is imposed on the components of potential can be written as 


VAY = (46.02) 
where the A” are the contravariant components 
AY = gA, (46.03) 
According to (41.08) an explicit form of (46.02) is 
1 7) 
——. — {Vv (-9) A} =0 (46.04) 
The connection between fields and potentials is given by 
F,, = V,4, — VA, (46.05) 


but, as was already noted in Section 41, the terms in which covariant and 
ordinary differentiations differ, cancel when the difference in (46.05) is taken, 
so that for the antisymmetric field tensor we have the expression 


Moa (46.06) 


which is the same as (24.10). The first set of Maxwell-Lorentz equations, 
which in the usual notation has the appearance 


1 oH 
curlE +—-———0; divH = 0 (46.07) 
c at 
can now be written as Fg = 0 (46.08) 


in agreement with (24.17), F,,, being the totally antisymmetrict third rank 
tensor which is expressible in terms of Fy as 


Fava = Vol + ViF ig + VE, (46.09) 
in generalization of (24.12). According to (41.28) this expression is equal to 
oF oF oF 
— uv va ou 
uvo Or, ze 2c, oF oa, (46.10) 


because F.,, is antisymmetric. 


Thus equation (24.12) remains unchanged. We now write down the second 


set of Maxwell-Lorentz equations in arbitrary coordinates. In the usual 
notation this set appears as 


div E = 4m ; eurlH —~—_ = jj (46.11) 
According to (24.20) the left-hand side of (46.11) 


divergence of the field tensor and, according to 
in general coordinates as 


represents the contravariant, 
(41.25), it may be expressed 


V FH = 


1 98 
Va) Oa, 'V (—9)- FH} (46.12) 


T i.e. antisymmetric in all its indices. 
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since the field tensor is antisymmetric. On the right-hand sides of (46.11) 
we have 47/c times the contravariant current vector, i.e. the vector 


Ar 
se =e (46.13) 


where p* is the invariant density and u# the contravariant velocity of the 
charge. Therefore, in general coordinates the equations (46.11) will appear as 


1 a 4dr 
eet —* . fruy = gt —— Kobe 46.14 
and the law of charge conservation follows from them in the form 
1 4) 
V glt = eee — «gt — 0 46.15 
Sey er 2a, (V (—g)-s*} (46.15) 


In order to write the generally covariant equations of motion of a charged 
mass point in an external field it is sufficient to find an expression for the 
vector of acceleration. If zis the proper time, the four-dimensional velocity is 


Pia 46.16 
i = : 
dt ( 
where 9,,uew = c? (46.17) 
The acceleration vector w” will be given by the left-hand side of (38.34) so that 
dx, dx, dx 
Pe Pee 46.18 
a dr oe dt dt ( ) 


This quantity is indeed a vector and in Galilean coordinates it goes over into the 
ordinary expression for acceleration. As for the expression for the Lorentz 
force, we have already stated it in a covariant form in Section 25. Therefore 
the equations of motion will be 
e 
w= ——wFr (46.19) 


ue me a 
where the w, are the covariant components of acceleration given by 
W, = Jy” (46.20) 

In Section 33 we considered the energy tensor of the electromagnetic field 
and represented the density of Lorentz force as its (negative) divergence. The 
corresponding generally covariant equations are easy to state. 

The generalizations of the expressions for the energy tensor of the electro- 
magnetic field are 


1 1 

La To Pal a Taft aes ° (46.21) 

and Uw = — oy aie + | wR Fee (46.22) 
4? 38 16x =e 


Using the Maxwell-Lorentz equations (46.08) and (46.14) we get, after some 
calculation : 


1 
VU" = Ho" Fs" (46.23) 
Te 
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On the right-hand side we have here the negative of the Lorentz force density. 
The generally covariant equations of motion for a continuous distribution of 
charge can be written as in (33.09) in the form 


1 
Kye = — —g??F, s* (46.24) 
U ur a! oa 
where u* is the invariant rest-mass density which satisfies the equation 
1 7) 
V,(utu?) = —— —{V/(—9)-wrug} = 0 (46.25) 
ot /( —9) Ox, 
As for the acceleration w®, according to (46.18), it can be written as 
du?’ 
we = — + FP veut (46.26) 
dt 
ou? 
or uw = ar + Te ueut (46.27) 
The last expression is equal to 
we = ueV uP (46.28) 


where V,u° is a covariant derivative. 
If in analogy to (37.01) we introduce a mass tensor for the charged particles, 
namely 


1 
@r = Gurury (46.29) 


and if we use the law of rest-mass conservation (46.25) we can write 

u*we = c?V 0° (46.30) 
In the equations of motion (46.24) we can express the left-hand side in the form 
(46.30) and we can use (46.23) on the right. Then we get 


V Tee = 0 (46.31) 
where o?Tes = u*urue + Lee (46.32) 
which is c? times the mass tensor of the system of particles and field. 

In a similar way we can write down the generally covariant equation of 
motion of a continuous medium of the type of an ideal fluid. Denoting by 
u* the invariant rest-mass density, including that part of it which changes as 


a result of changes in energy of compression, we get, as in (32.17), for the 
complete energy tensor, i.e. for c? times the mass tensor : 


(2 Teo (us dis Few Spgs (46.33) 


This also satisfies the conservation law (46.31). We note that equation (46.25) 
will now no longer be satisfied but only a similar equation for a quantity o* 
related to u* by (32.25) and representing the invariant density of that part 
of the rest-mass which is conserved. 

In Section 31 we established a general rule according to which the mass 
tensor must be a function of the state of the system and cannot depend 
explicitly on the coordinates. This rule referred to Galilean coordinates, in 
which the g*” have given constant values. We can retain this rule for arbitrary 
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coordinates if we agree to inelude among our functions of state the g*’. (The 
identities (40.14) and (40.16), which express Ricci’s lemma, will formally play 
the part of field equations for the g*”.) 

In all examples here considered, the mass tensor obeys this gencralized rule. 
For instance, in (46.33) the functions of state involved are the velocity eom- 
ponents u®, the pressure p, the invariant density u* and the components of the 
metric tensor g®°. (Not all these functions are independent.) In the energy 
tensor of the electromagnetic field the funetions of state are the field components 


Fg and the g*’. 


47. A Variational Principle for the Maxwell-Lorentz System of 
Equations 

Many equations of mathematical physies can be formulated as conditions 
that a certain integral, called the action integral, take on an extremal value. 
One of the simplest examples of such a formulation is given by the equation 
of a geodesic considered in Section 38. We shall now investigate the rather 
more complicated example of the Maxwell-Lorentz system of equations which 
describe the motion of a continuous charged medium, the elements of which 
interact by means of the electromagnetic field. 

The Maxwell-Lorentz system of equations involves the field components 
F.,, the components uw” of four-dimensional velocity, the invariant rest-mass 


ev? 
density u* and the invariant charge density p*. They have the form 


oF oF oF 
wy 4 SF 4 Sa _ 9 (47.01) 
OX or, on, 
dn. 
V Fey = eye (47.02) 
o* 
and u*w, + —F qu? == 0 (47.03) 
Cc v 


Here w, is the covariant component of the acceleration vector of which the 
eontravariant components arc e 
w* = uvV ur (47.04) 
The invariant densities u.* and o* satisfy the equations 
V (wut) = 0 (47.05) 
and V,(p*ut) == 0 (47.06) 
the second of which is a consequenee of (47.02) 
We must seek to construet an action integral in such a way that variations 
with respeet to the functions involved will give rise to the equations stated. 
We ean simplify the task by the introduction of anxiliary functions chosen 
in such a way as to satisfy some of the equations identically ; the variation 
must then give the remaining cquations. 
With this aim we introduce the potentials and the positions of the particles 
of the medium, and express the remaining functions in terms of these. 
Putting 


CL cr, 


Pr a él, = CAD (17.07) 
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we satisfy (47.01) identically and we shall regard the potentials A, as functions 
to be varied. 

For the description of the motion of the medium we introduce Lagrange 
variables a,, @, @, and ay = p, where a, a, and a, are, for instance, the initial 
coordinates of a particle of the medium and p is a parameter of the nature of 
a time. The functions to be subjected to variation will be 


Gy = fa(Ps A 22, Ga) (47.08) 


For constant a,, @, and a, and varying p they give the motion of a particular 
particle of the fluid. A variation of these functions, i.e. a variation of the 
positions, will be called a displacement and denoted by &: 


2g —s dx, = Sf (P, Q1, Go, a3) (47.09) 


This notation is justified because such variations represent an infinitesimal 
contravariant vector. We shall, in general, think of the &* as functions not of the 
Lagrange variables (p, @,, @,, @3) but of the coordinates (x9, 2,, %, 3) which 
are related to the Lagrange variables by (47.08). 

The components of the four-dimensional velocity can be expressed in terms 
of the Lagrange variables as 


CAL saan 
MEPVCR CAL DICALD) (47.10) 
so that the relation u,ut = 0 a7.) 


is identically satisfied. In (47.10) the g,,, have as their arguments the z, = i 
so that when f, is varied the change in the g,, must be taken into account. 

The introduction of Lagrange variables causes the equation of continuity 
(47.07) to be identically satisfied in terms of the expressions (47.10) for the 
velocity and the expression for the invariant density determined from the 
relation 


w*4/(—9) Lf = Flay, a5, as) Nie oy (47.12) 


Here I is the absolute value of the Jacobian 


D(xo, Ly, Ly, L3) 


I (47.13) 


7 Dp, &), Bo, a3) 
We denote by 8,u* and 8,u* the variations due to the change in the form 
of the functions f,. The quantity u* + 5,u* is the velocity of the varied motion 


at the point x, + &. On the other hand, the velocity of the varied motion 
at the point x, is 


ue + due 4 


out ‘ 
BE + 8 yu (47.14) 


Thus the variation 8u* which describes the change of the velocity field is 


oS ee (47.15) 
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We note that the variation 3,u* is the difference of the values of two vectors 
taken at different points and is, therefore, not a vector. In contrast du* is the 
difference of the values of*two vectors at one and the same point and is, there- 
fore, a vector. Similarly we can introduce variations $,.* and du* for the 
invariant density. They are related by 


au* 
ou* = §u* — i 47.16 
ad it aa ( ) 


For the calculation of 5.* we must find the variations of both sides of (47.12). 
We have 


af, Hf, 1 1 Gres Se af, a 
a(,/ (s “Op z)| = V (Gag 41 OP) careana oe en, Bi,” ‘Op ep + Suc aa 
17 


Inserting 
= = oe (47.18) 
‘Op ax, Op 


and using the expression (47.10) for the velocities, we get 


of, of, | - ( of, “a) ou (; 29 o a 
a (0u58 ze) =f \%p ap) ot NS a2,” 7 9G, 
(47.19) 
Using the relation 
ee 
VES ba, 


ieee (47.20) 
for the covariant derivative we can write the foregoing equation in the form 


a (o.58 #)| J (4.0 ze a2) 2 sate YEP (47.21) 


The variation of the right-hand side of (47.12) will be proportional to. this 
expression. Further, the varied value of the Jacobian J will be 


D(x + E9 gy 4b), toe Eta a EF) 


P= D(p, dy, @y, @s) 
x ee — ps (1 a =I (47.22) 
whenta re a] (47.23) 
We also have ; 
SifV(—9)} = ve (47.24) 
{ev/(—9)-& 


and, therefore, §i{/(-9) 2} = (47.25) 


Ot, 


a 


or SifV(—9) 2} = VE (—9) (47.26) 
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Using (47.21) and (47.26) we can write the variation 8, of the logarithms of 
both sides in the form 


ae = =; u,urV5° (47.27) 

u* 
whence §y* = —p*V,6° + — uu’. ag (47.28) 
and by (47.16) Suk = —V,(u*é) + re uw e2 (47.29) 


In calculating the variation 8, of the expression (47.10) for u* we can use the 
result (47.21). The calculation gives 


ae | ee ee 
dyu* = wu a, — u% wa Ma" VE (47.30) 
oe = du® 1 2 
and, from (47.15), 8u* = u? aS — Fo ytuurV EO (47.31) 
oz, O92, c 


The last expression can be written in the form 
1 
Set Sa VE = OV a utu,u’ V Eo (47.32) 
c 


whence it is evident that $u* is a vector. We also have 
u,du* = 0 (47.33) 
in agreement with (47.11). 
As the charge density p* satisfies the same form of continuity equation 
(47.06) as the mass-density y* its variation will have the same form as (47.29), 
namely 


o* 
dp*¥ = —V,(e*E°) + = 5 Ue wV_E° (47.34) 
Combining (47.32) and (47.34) and using eas we get 
3(p*u%) = V (eure cae o* uk?) (47.35) 
1 o 
dS(o¥ut) = ———_ — a). A®(y5Fa — 9/4 O id 
ne en ogy on i eee (47.36) 


whence it follows that the varied current vector also satisfies an equation of 
continuity, as was to be expected. 

We note that if the displacements —° are proportional to the velocity, so 
that E° = tu° where + is an arbitrary function of the coordinates, the varied 
velocity field and also the varied density do not differ from the non-varied 
ones. This can be verified by a direct calculation ; putting &° = tu° into our 
equations gives du* = 0, du* = 0 and Sp* = 

Having calculated the variations suffered by all quantities as a result of 
displacements &° it is easy to verify that the field equations (47.02) and the 
equations of motion (47.03) eas ss conditions that the integral 


S= | (ew - ie Pwd, tee — Ff, re) /(—g)- (dz) (47.37) 
should have an extreme value. For on we have written 


(dx) = day da, dx, day (47.38) 
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The integral (47.37) is to be taken over some four-dimensional region on the 
boundaries of which the variations &° and 3A, vanish. 
We calculate first the Variation of the last term in (47.37). As is readily 
verified 
8(F oF) = 2F* SP o (47.39) 
Therefore 


rad | Fak? Vi-9)- (de) = & [ F 8Fyy V(-9)- (a) 


1 085A, _ BA, 
~ 8x i (= an, an, 4/(—9): (dx) (47.40) 


Since the tensor F** is antisymmetric the contributions of both terms in the 
bracket are oe therefore we have 


1 a3A 
wag) | Pek Vio a) = — 5 [8S (9) (ee 


2a —g)- F*\-3A (dx) (47.41) 


after an integration by parts. 
The variation of the second term in (47.37) is, by (47.36), equal to 


3 5 [ otu*A, y/(—g): (dx) 
c = 
1 1 a 
ae { pwd, V(—g): (dr) — - | Ay ge LV(—9) 0 (ures — we) da) 


(47.42) 
Performing an integration a parts we obtain from the last integral in (47.42) 


4 | (uote — we (—g) + (de) 
lf oA OA 
== i etree (2 — 2) (9) (ae) 
--: | otek,£¢ \/(—9)- (de) (47.43) 


Thus the second term gives 
a! 
— 28 | ousd,y(—9) (de) = = | ete, + PV (—9) (de) (849 
Finally, the variation of the first term in (47.37) is, by (47.29), 


Bf ctuty(—g) (da) = [ [— et (ute) + wtuuVQE°] V(—9) (a) 
(47.45) 


After integration by parts the term on the right proportional to c? vanishes 
and the area term grat 


3 { oy" (dx) = — f urwV,u,Jee V(—g)-(de) (47.48) 
For, as a result of the pions of continuity (47.05), we have 
wu wVE° = V(utuwe) — w*(wrV ue" (47.47) 


and the integral of the first term is zero. 
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Following (47.04) we introduce the notation 
Wy = UV Uy (47.48) 


for the acceleration and get 
8 [uty (—g)- (dx) = — J wtertev(—9) (de) (47.49) 


Combining equations (47.41), (47.44) and (47.49) we obtain the following 
final expression for the variation of the action integral (47.37) : 


ag =~ [ (uw, + pau) V(—a)- (a) 


4 | (- i ut + = Vy) 84,V/(—9)-(d2) (47.50) 


Here the coefficients of £° and of 8A, are zero by the equations of motion 
(47.03) and the field equations (47.02). Thus we have proved that 
dS = 0 (47.51) 


Conversely, from the condition that the action integral 
- 1 
S=| (c* — f uwtA, + ad —g)- (dz) (47.37) 
c * " 16r 


be an extremum and, using the fact that the variations €° and $A, are arbitrary, 
we can conclude that the equations of motion and the field equations are 
satisfied. 

The circumstance that a variation of the form &° = tu? does not change the 
velocity field, gives rise to the relation 


*K 
w(whn, es ‘. Fat) =0 (47.52) 


which is valid identically, ie. independently of the equations of motion being 
satisfied. This relation shows that of the four equations of motion only three 
are independent. The fact that a variation of the form $A, = 8 d~/dz, does 
not change the electromagnetic field gives rise to the relation 


v(— eu + vy, ree) — 0 47.53) 
{25 ix '* )= eo 


which is also identically fulfilled, i.e. independently of the field equations. 
This relation shows that the four field equations are not independent but are 
related by (47.53). 


48. The Variational Principle and the Energy Tensor 

In Section 45 we pointed out that in the formulation of the Theory of 
Relativity in arbitrary coordinates two points of view are possible. From the 
first point of view the g,,, are considered as given functions of the coordinates 
which are obtained by the usual tensor rule from their Galilean values. From 
the other the g,,, are thought of as unknown functions subject to the equation 


Ruy, ag = 0 (48.01) 
and also to some inequalities which we have stated repeatedly. 
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When calculating the variation of the action integral we adopted the first 
point of view and, accordingly, the form of the functions g,, was not varied. 
We can, however, adopt the second point of view and vary the g,, also. This 
will lead to the appearance of a third term in the expression (47.50) for 3S 
in addition to the two terms corresponding to the variations —° and 84,, a 
term arising from the variation of the g,,. We shall now calculate this term. 

We shall denote the variations of quantities due to the variation of the 
Juv by the symbol 8. In (47.12) neither the Jacobian J nor the function F 
contains the g,,, therefore 


Burv(—o) ale» Aw BG 
wV(—9) Gaal Fl oP FalOp) 2c? 
and. consequently 


ubu’ 89. (48.02) 


* 
8 {n*/(—9)} = eae 
Further, if we multiply (47.10) by (47.12) we obtain an expression that does 
not contain the g,,; the variation of this expression with respect to g,, will be 
zero. Since the charge density p* satisfies the same continuity equation as 
the mass density u*, the variation of the corresponding expression with p* 
replaced by e* is also zero. Thus 


ubu’ 89... (48.03) 


8,{e*u*/(—g)} = 0 (48.04) 
and also 8,{e*utA v/(—9)} = 0 (48.05) 
as the A, are now not varied. 
Further, 
8,(Fagh*) = F,,8, Fr (48.06) 


since the F,,, are not varied, but only the F#*, for they are connected with the 
F,, by the relations 


Fe — gughP (48.07) 

which contain the g,,. The calculation gives 
Fd gh? = 2F FG, 89 = —2F PF ough 8g. (48.08) 

1 1 
Since ———. 8, V(—9) = = 9" 8g, (48.09) 
vV(—-9) ° ZF th 
we get 
1 o4 iv 1 ‘a Vv —— vo 
V/(—9) oF, re V(—9)} = (—2F ug® oe 2 FF i 89.4 = 8x4 89uy 

(48.10) 


where U*” is the expression (46.22) for the electromagnetic energy tensor. 
Equations (48.03), (48.05) and (48.10) result in the following additional term 
in the variation of the action integral 


BS = 5 | (ututur +O) 854, V(—9)- de) (48.11) 


But according to (46.32) the expression in brackets under the integral is c? 
times the mass tensor of the system of particles and field : 


eT — pw*ytuy + UH (48.12) 
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Thus, the expression for the mass tensor appears automatically when the action 
integral is varied with respect to the components of the metric tensor. 
The complete variation of the action integral with respect to all functions 


involved in it is 
3S = ; | (u*utu” + U)89,,, /(—9): (da) 
* 
— f (ura, + = Ratt \Ey(—9) (aa) 
. o* I 
if | (- ae vie), V/(—9) - (dx) (48.13) 
Cc dn 


The variations &° and $4, were completely arbitrary ; equating their coefficients to 
zero gave us the equation of motion and the field equations. The variations 8y,,,,, 
however, are not arbitrary because the functions g,,, are subject to the condition 

Raya = 9 (48.01) 
We shall now express the variations 8g,, in terms of independent arbitrary 
quantities. 

The most general form of functions g,, which satisfy (48.01) can be obtained 
by coordinate transformation starting from the Galilean values of these 
quantities, therefore all possible forms of the g,,, compatible with (48.01) can 
also be obtained one from the other by coordinate transformations. Thus 
the permissible infinitesimal variations of these functions must correspond to 
infinitesimal coordinate transformations. 

Let an infinitesimal coordinate transformation be written in the form 


f 
t= 2p (48.14) 
where 7* is an arbitrary infinitesimal vector whose components are functions 


of %, 2%, 7, and v3. According to the general transformation formula for 


tensors we have : 1 
ox, Ox, 


W(a!) = g(x 
smal) = ge) (48.15) 
and, ignoring higher order infinitesimals, 
P , ony one 
g'* (x’) = gt a ae 
(z') = g(a) +9 an, +9 ae (48.16) 


To obtain the variation, ic. the change of the form of the functions g"’, one 


inust compare g’#” and g” for the same values of the arguments. 
As we have 


cy 


ge) = g'(a) +5 


y9 (48.17) 
(having replaced g’*” by gy” in the correction term), it follows that 


fay vf. ' an ort cy 
ye) — g(a) = By(x) a= gee Sy ye OO 7 (48.18) 
cr, Cx, Cr. ; 
This equation can be written as : 
sgh) = yt0V ny PV nt (48.19) 


or, more briefly, as 


Sgt = Viny fp Vint (48.20) 
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In the case of the Maxwell-Lorentz equation we verified this by direct calcula- 
tion (see equation (48.12) ). 


Let us illustrate the above reasoning with another example, namely, the 


equations of hydrodynamics. In this case the action integral will be of the 
form 


S = | (*e? + o*I)V/(—g) (de) (48.28) 


Here p* is the invariant density of that part of the rest mass which does not 
change in the motion and which satisfies the conservation law 
V,(o*u") = 0 (48.29) 


(see Section 32) and IT is the potential energy per unit mass of elastic com- 
pression of the fluid 


(48.30) 


p being the pressure. 

It is simple to calculate the variation of the integral S, from the expressions 
for the variation of density derived above. Using equations (47.29), (48.03) 
and (48.09), in which we now write o* instead of u* the total variation becomes 


* fy,¥ 
apt = votes) + Tuy, +2 (22 — oma, 48.81) 
SV(-9) _ 1 

— _ gs 48.32 
and, as before, en 5 I 89, ( ) 

Putting 
F(e*) = p*c? + o* Il (48.33) 

we have 
= * ay as F'(o*) So* + Flo* 8 {/(—9)} —g)-(d 
B58 | F (pt) v9): (a) {| (6) 89% + Flot) I ae 


Inserting (48.31) and (48.32) and integrat.ng by parts we get 


BS = 5 [ | @ etry om + eee SE] 80. =a) (de 


+ { for(-“ S) — etn, eev(—a)-(de) (48.38) 


2 2 
ce On, c 


where w, is the acceleration (47.48). Using the relations 
oF’ —-F=p; p*F"do* = dp (48.36) 
which follow from (48.30) and (48.33) we can replace (48.35) by 


as 5 | [— rom + [or + 4 (etl +2)] wth Bau, v9): 


+ | (4S 2 [ert sot talefevi-o-@s) 4837 
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The analogous equation 
39 uy = VN ao Vols (48.21) 
can be obtained either independently by the same method or by use of the 
relation g,,g"” = 9}. 
Rewriting (48.11) as 


2 
3S = > i T#3g ,./(—g)- (dx) (48.22) 
and inserting (48.21) we get 


2 
3,8— 5 f Dn, + Van) V(—9)- (de) = 08 | DH Ving V=9) (ae) 
(48.23) 
since the tensor 7',, is symmetric. After integration by parts we get 


SS = 0? f(VT™\n, V(—g): (da) (48.24) 


If it is assumed that the divergence of the mass tensor vanishes, it can be 
concluded from (48.24) that the action integral also has an extremal value for 
permissible variations of the g,,. However, a reversal of the argument is also 
possible and the equation 


V Te = 0 (48.25) 


can be deduced from the properties of the action integral, T#’ being the tensor 
that appears as the coefficient of 3g,, in the expression for the complete variation 
of S. For the action integral is a scalar, or invariant, and therefore does not 
change for an infinitesimal or any other coordinate transformation. The 
expression for its complete variation 


3S =o? | (V,T™)n,/(—g) (dex) 


— | (ure, + a Pt )y(—9) (aa) 


* 1 
4 | (- " ut bo Vil) B4V/( —g) - (dz) (48.26) 


must necessarily vanish, if the variations €° and 8A, occur as a result of the 
infinitesimal transformation y,. This conclusion follows from the invariance 
property alone and is not dependent on whether S is the action integral or any 
other invariant integral. But, if S is the action integral, the coefficients of 


E° and SA, are separately equal to zero. Therefore, the remaining terms in S 
must also be zero and we have 


8,9 = c? | (V,T")n»/(—g)- (dz) = 0 (48.27) 


for arbitrary y,. Hence (48.25) follows. 

The fact that the tensor T+’, defined as the coefficient of 39, 18 the mass 
tensor or proportional to it, follows from the uniqueness of the mass tensor 
ir reece in Section 31, under the condition that its components are functions 
ot the state of the system ; here the Juy are to be included among such functions. 
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Hence we conclude that the equations of motion have the form 


| Op uu’ ep 
* _ *77 Se 
E + (OMT + »)| pe ak oe (48.38) 
and that the energy tensor is 
1 
ety — Gi + 3 (o* TT + »)| ubuY — pot (48.39) 


We know also that the divergence of this tensor vanishes. 

These expressions are a generalization of (32.29) and (32.28) and reduce to 
them in Galilean coordinates. 

In conclusion, we remark that, mathematically speaking, by no means every 
system of equations can be derived from a variational principle. The fact 
that the basic equations of mathematical physics can be so derived reveals a 
remarkable property possessed by them. 


49. The Integral Form of the Conservation Laws in Arbitrary 
Coordinates 


As we know, the mass tensor 7” satisfies the equation 
Vir“ = 0 (49.01) 


which, for a closed conservative system, must express the conservation laws 
for energy sud momentum and also the conservation law for angular momentum 
and the law of mass centre motion. 

In explicit. form (49.01) can be written as (see (41.24) : 


oT 
Ox, 


v 


Pere + Ps Te = 0 (49.02) 
or, if the first and last terms are combined, 


1 &/(—g): T} 
a Te Ter — 0 49.03 
v/( —9) ox, ¥ a 


Owing to the fact that in (49.03) there is a term remaining outside the sign 
of differentiation the passage from the differential to the integral form of the 
conservation laws is not quite as obvious in general as in Galilean coordinates 
(see Section 31). Therefore, we shall discuss in more detail the question of the 
existence of integral forms of the conservation laws. 

We introduce a vector @, and integrate the equation 


1 O 

LY, = pe —g): TY 49.04 
V(P0,) = ma ge (V9) PH9,) (49.04) 
over a three-dimensional volume on the boundaries of which JT’ vanishes. 
We get 


d 
i V, (T¥9,) V( —9) dx, dx, dry = FER | Te A (9) dx, dx,dz, (49.05) 
0 
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If T+’ is the mass tensor it is symmetric and of vanishing divergence. For such 
a tensor the previous equation can be written as 


ue | T¥0q, \/(—g) dey dry dy 
dz = 


l | TH(V,e, + Vyg,) V(—9) dary dg dey (49.06) 


a) 


= 


The quantity = | TH% W/(—-9)° dz, dr, dz, (49.07) 


will be constant, i.e. will be independent of zg, the coordinate that has the 
character of time, if the vector 9,, satisfies the equations 


Vio, + Vie, = 0 (49.08) 
We first discuss these equations in the general case that the fourth order curva- 
ture term is not necessarily zero. 
We put 
ViPu = Puy (49.09) 
By (49.08) the 9,, form an antisymmetric tensor ; we calculate its covariant 
derivative. By (49.08) we can write 
Vv 


oPyy 


1 1 1 
=5 Ws Se ie 5 (Viv, — Ve). 5 (V,V, — V,Y.)9, 


(49.10) 


Using the relation (43.16) for the difference of covariant derivatives and the 
symmetry properties of the tensor Ff, 4, we get 


VP uv = Fe, Pe (49.11) 
Thus the equation (49.08) can be replaced by the system (49.09) and (49.11) 


which is equivalent to a system of total differential equations. It can be 
shown that this system will be completely integrable in the case that 


Ruy.ap = E (IaIus — Iuadvo) (49.12) 
where A is a constant. A space in which the curvature tensor has the form 
(49.12) is called a space of constant curvature ; it is a four-dimensional generaliza- 
tion of Lobachevsky space. The constant K is called the constant of curvature. 

If the conditions of complete integrability are fulfilled, the values of the 
ten functions , and g,, are determined throughout space by their values at 


one point, so that the general solution of the systern (49.09) and (49.11) involves 
ten constants. 


We are interested in the case 


Ray og = 9 (49.13) 
which is obtained from (49.12) by taking K = 0. (For this reason a space for 
which F.,,,4g = 0 is called a space of zero curvature, or a flat space.) 

In flat space-time, with which we are concerned in the Theory of Relativity, 
we can immediately indicate ten integrals of the system (49.08). Let see 
z, and x, be Galilean coordinates. We can then introduce the four vectors 


f , 


Xo. 


Ox, bx. 
9 = a — —!, Cyc 22 es Gy. 3 
a a ax,’ Gy a) 


+ An exposition of the theory of such systems can be found in Reference [14]. 
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(These quantities are vectors with respect to the unprimed coordinates 29, 2, 
Z_, L3; the Galilean coordinates are to be considered as scalars.) By virtue of 
(42.16) each of the four vectors (49.14) satisfies 


Vio =0; («=0,1,2, 3) (49.15) 


and, therefore, also equation (49.08). 
Further, we can introduce the six vectors 


(2?) — 7’ eh, = Le on, (49.16) 
2 = on, ox, 
each of which satisfies 
Vio ae Vig(2®) =H (49.17) 


which is the same as (49.08). 
Insertion of the vectors (49.14) into (49.07) and multiplication by ¢ gives 
the constant integrals 


Pte i quo ae nl Gy die ae ae, (49.18) 
i 


and insertion of the vectors (49.16) into the same expression gives the further 
constant integrals 

M's? = ¢ i 14( ~ ~~ La a) V( —g) dz, dx, dx3 (49.19) 

u u 

Comparing these expressions with their counterparts in Galilean coordinates 
(equations (31.02), (31.03) and (31.08), (31.09)) it is simple to see their physical 
meaning. The quantity P’° is the energy divided by c, or the mass multiplied 
by c, the P% (¢ =1, 2, 3) are the components of momentum. Further, the 
M” are the integrals of mass centre motion and the Mt the integrals of angular 
momentum. 

The P’* are components of a constant vector and the M’* the components 
of a constant antisymmetric tensor with respect to a Galilean coordinate 
system. But to a constant vector or tensor in a Galilean system there corresponds 
in an arbitrary coordinate system a free vector or tensor, i.e. one all of whose 
covariant derivatives vanish. Therefore, in an arbitrary coordinate system 
we can relate to the integrals of motion the free vector 

3 


ax’ 
a Pp —* 49.20 
” 2, i‘ ax, 
and the free antisymmetric tensor 

3 ’ t 

Ox, Ox, 
eh fa “a "8 49,21 
M,,, 2a ae (49.21) 
These quantities will be functions of the space and time coordinates satisfying 
V.P,=0; V,M,,=0 (49.22) 


The values of these functions at all points are determined by their values at 
any one point. Therefore, the number of constants on which the P, and M,, 
depend will be equal to the number of these quantities, namely, ten. The 
constants of motion P’ and M’@* are just these constants. 
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In the above discussion the role of the Galilean coordinates was purely 
auxiliary, they were simply used as variables in which to express the various 
vectors corresponding to the different integrals of equation (49.08). It is, of 
course, much simpler to conduct the whole argument in terms of Galilean 
coordinates, as was done in Chapter II. However, the very aim of the more 
complicated discussion was to show that even the integral form of the con- 
servation laws can be derived directly from equations written in arbitrary 
coordinates. 

The formulae given in the last two sections of this chapter enable us to formu- 
late the condition for the space-time to be uniform. This can be done in the 
following way. oe 

As stated in the Introduction, space-time is of maximum uniformity if a ten- 
parametric transformation group exists that leaves the functional forin of the 
Ju, invariant. (This means that the g,, transformed according to the tensor rule 
are the same functions of the new variables as the original g,,, were of the old.) 
For an infinitesimal transformation (48.14) belonging to this group we have 
89.» = 0, where 8g,, is given by (48.21). But if in the equations 3g,, = 0 we 
replace the y, by the o, the equations will take on the saine form as that of the 
equations (49.08), which are related to the conservation laws. For these latter 
equations the condition for total integrability is (49.12). Thus this equation 
will also be the condition for integrability of the cquations $9, = 0. 

We see that the necessary and sufficient conditions for space-time to be 
uniform coincide with the conditions for the existence of an integral form of 
the conservation laws for the mass tensor. Therefore the conservation laws for 
those forms of energy which are included in the mass tensor are directly 
connected with the uniformity of space-time. 


49*. Remark on the Relativity Principle and the Covariance of 
Equations 


At the beginning of this book (Section 6) we gave a formulation of the prin- 
ciple of relativity, which together with the postulate that the velocity of light 
has a limiting character, inay be made the basis of relativity theory. We shall 
now investigate in more detail the question of the connection of the physical 
principle of relativity with the requirement that the equations be covariant. 
We have already touched on this problem in the Introduction. 

In the first placc, we shall attempt to give a generally covariant formulation 
of the principle of relativity, without as yet inaking this concept more precise. 
In its most general form the principle of relativity states the equivalence of the 
coordinate systems (or frames of reference) that belong to a certain class and 
are related by transformations of the form | 


©, =f, (Lo, Ty, Ly, 2's) (49*.01) 
which may be stated more briefly as 
a= Fa) (49*.02) 


It is essential to remember that, in addition to the group of permissible transformations, 


the class of coordinate sys shar i i 
vstems must be characterized by certain supplementary conditions. 
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Thus, for instance, if we consider Lorentz transformations, it is self-evident that. these linear 
transformations must connect not any arbitrary coordinates, but only the Galilean coordi- 
nates in two inertial reference frames. To consider linear transformations between any other 
(non-Galilean) coordinates has no sense, because the Cialilean principle of relativity has no 
validity in relation to such artificial linear transformations. On the other hand, if one intro- 
duces any other variables in place of the Galilean coordinates, a Lorentz transformation can 
evidently be expressed in terms of these variables, but then the transformation formulae 
will have a more complicated form. 


Let us now state more precisely what is meant in the formulation of the 
principle of relativity by equivalence of reference frames. Two reference frames 
(x) and (x’) may be called physically equivalent if phenomena proceed in the same 
way im them. This means that if a possible process is described in the coordinates 
(x) by the functions 


Pi(Z), Pol), --- , Pn(Z) (49*.03) 
then there is another possible process which is describable by the same functions 
Pilz’), Pole’), --- 5 Pal’) (49*.04) 


in the coordinates (x’). Conversely any process of the form (49.04) in the second 
system corresponds to a possible process of the form (49*.03) in the first system. 
Such a definition of corresponding processes agrees fully with that given in 
Section 6. Thus a relativity principle is a statement concerning the existence of 
corresponding processes in a set of reference frames of a certain class; the systems 
of this class are then accepted as equivalent. It is clear from this definition 
that both the principle of relativity itself and the equivalence of two reference 
frames are physical concepts, and the statement that the one or the other is 
valid involves a definite physical hypothesis and is not just conventional. In 
addition, it follows that the very notion of a “ principle of relativity ’’ becomes 
well defined only when a definite class of frames of reference has been singled 
out. In the usual theory of relativity this class is that of inertial systems. 

The functions (49*.03) or (49*.04) describing a physical process will be called 
field functions or functions of state. We have already indicated in Section 46 
that in a generally covariant formulation of the equations describing physical 
processes the components g,,, of the metric tensor must be included among the 
functions of state. In the example there discussed we then get the following 
collection of field functions: 


F(*), (2) uv) (49*.05) 


ie. the electromagnetic field, the current vector and the metric tensor. The 
requirement entering the formulation of a principle of relativity that in two 
equivalent reference frames ccrresponding phenomena should proceed in the 
same way applies equally to the metric tensor. Thus if we compare two cor- 
responding phenomena in two physically equivalent reference frames, then for 
the first phenomenon, described in the old coordinates, not only the compo- 
nents of electro-magnetic field and of current density, but also the components 
of the metric tensor must have the same mathematical form as for the second 
phenomenon described in the new coordinates. 

What can be concluded further will depend on whether we assume that the 
metric is fixed or whether we take into consideration phenomena which them- 
selves influence the metric. In the usual theory of relativity described in the 
previous chapters it is assumed that the metric is given once and for all and 
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doés not depend on any physical processes. The generally covariant formulation 
of the theory of relativity given in the present chapter does not change anything 
in this. As long as the assumption remains im force that the character of space- 
time is Galilean and the g,, are introduced only to achieve general covariance, 
these quantities will depend only on the choice of coordinate system, not on the 
nature of the physical process discussed; they are functions ot state only in a 
formal sense. In the theory of gravitation on the other hand, to the description 
of which we tum in the following chapter, a different assumption is made 
concerning the nature of space-time. There the g,,, are functions of state, not 
only in a formal sense, but in fact: they describe a certain physical field, namely 
the field of gravitation. However, when discussing small-scale processes which 
do not influence the motion of heavy masses one can also assume that the 
metric is fixed (though not Galilean). 

To give a definite meaning to the principle of relativity in such circumstances, 
it is essential to specify more closely not only the class of coordinate systems, 
but also the nature of the physical processes for which the principle is being 
formulated. 

We shall first start from the assumption that the metric is fixed (‘rigid ’’), 
or else that it may be considered as fixed for a certain class of physical processes. 
We return to the above definition of corresponding phenomena in two physically 
equivalent coordinate systems, according to which all field functions, including 
the components of the metric tensor, must have the same mathematical form 
for the first. process described in the old coordinates as for the second process 
described in the new coordinates. If the g,, are independent of the nature of 
the physical phenomenon, then in relation to these quantities we need not make 
a distinction between the first and second process, and need consider only 
transformations of the coordinates. But then the quantities 


Iuy(Z) and GIuv(X') (49*.06) 


will be connected by the tensor transformation rule; the requirement of the 
relativity principle that they should have one and the same mathematical form 
reduces (for infinitesimal coordinate transformations) to the equations 8g, = 0 
discussed in Sections 48 and 49. 

We know that the most general class of transformations that satisfies these 


equations contains 10 parameters and is possible only in uniform space-time, 
where the relation 


Fey oa = K-(9.9us — GuxIop) (49*.07) 


(see equation (49.12) ) is valid. If in these relations Kt is zero, the space-time 
is Galilean and the transformations in question are Lorentz transformations, 
except that possibly they may be written down in other (non-Galilean) co- 
ordinates. 

Thus with the rigidity assumption for the metric, the principle of relativity tin- 
phes the uniformity of space-time, and if the additional condition A =: 0 holds, 
we obtain a Galilean metric in appropriate coordinates. The re 
im general form then reduces to the Galilean rela 
condition A = 0, it results in an additional w 
scale of the Galilean coordinates 


lativity principle 
tivity principle. As for the 
niformity of space-time; if the 
is changed, then the scale of the elementary 
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interval changes in the same proportion.{ This property implies in turn that 
there is no absolute scale for space-time, unlike the absolute scale that exists 
for velocities in terms of the velocity of light; the absence of an absolute scale 
for space-time leads conversely to the equation K = 0. 

If we now go over to discuss phenomena which may themselves influence the 
metric, we must reckon with the possibility that under certain conditions the 
principle of relativity will be valid in non-uniform space also. For this to be so, 
it is necessary that the motion of the masses producing the non-uniformity be 
included in the description of the phenomena. 

Indeed, at the end of this book it will-be shown that wnder the assumption 
that space-time 1s uniform at infinity (where it must be Galilean) one can single 
out a class of coordinate systems that are analogous to inertial systems and 
defined up to a Lorentz transformation. In relation to this class of coordinate 
systems a principle of relativity will hold in the same form as in the usual theory 
of relativity, in spite of the fact that at a finite distance from the masses the 
space is non-uniform. Here however one must bear in mind the essential role 
played by the boundary conditions that require uniformity at infinity. Thus 
in the last analysis the relativity principle is here also a result of uniformity. 

Since the greatest possible uniformity is expressed by Lorentz transformations 
there cannot be a more general principle of relativity than that discussed in 
ordinary relativity theory. All the more, there cannot be a general principle of 
relativity, as a physical principle, which would hold with respect to arbitrary 
frames of reference. 

In order to make this fact clear, it is essential to distinguish sharply between 
a physical principle that postulates the existence of eprreaponding phenomena 
m different frames of reference and the simple requirement that equations 
should be covariant in the passage from one frame of reference to another. 
It is clear that a principle of relativity implies a covariance of equations, but 
the converse is not true: covariance of differential equations is possible also 
when no principle of relativity is satisfied. Quite apart from the fact that not 
all laws of nature reduce to differential equations, even fields described by 
differential equations require for their definitions not only these equations, but 
also all kinds of initial, boundary and other conditions. These conditions are 
not covariant. Therefore the preservation of their physical content requires a 
change in their mathematical form and, conversely, preservation of their 
mathematical form implies a change of their physical content. But the realiz- 
ability of a process with a new physical content is an mdependent question 
which cannot be solved a priori. If within a given class of reference systems 
“‘ corresponding ” physical processes are possible, then a principle of relativity 
holds. In the opposite case it does not. It is clear, however, that such a model 
representation of physical processes, and in particular such a model representa- 
tion of the metric, is possible at most for a narrow class of reference systems, and 
certainly cannot be unlimited. This argument shows once again (without in- 
voking the concept of uniformity) that @ general principle of relatuity, as a 
physical principle, holding in relation to arbitrary frames of reference, 1s impossible. 

+ It should be remembered that in the general case the expression for ds?, though always 


a homogeneous function of the coordinate differentials, may also depend in a non- 
homogeneous way on the coordinates themselves. 
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But as a motivation of the requirement of covariance of the equations & 
general principle of relativity is also unnecessary. The covariance requirement 
can be justified independently. It is a self-evident, purely logical requirement 
that in all cases in which the coordinate system is not fixed in advance, equations 
written down in different coordinate systems should be mathematically equiva- 
lent. The class of transformations with respect to which the equations must be 
covariant must correspond to the class of coordinate systems considered. Thus 
if one deals with inertial systems related by Lorentz transformations and if 
Galilean coordinates are used, it is sufficient to require covariance with respect, 
to Lorentz transformations (as was done in Chapters I and II of this book). 
If, however, arbitrary coordinates are employed, it is necessary to demand 
general covariance (Chapter IV). . 

It should be noted that covariance of coordinate systems acquires a definite 
physical meaning if, and only if, a principle of relativity exists for the class of 
reference frames used. Such is the covariance with respect to Lorentz trans- 
formations. This concept was so fruitful in the formulation of physical laws 
because it contains concrete chrono-geometric elements (rectilinearity and uni- 
formity of motion) and also dynamic elements (the concept of inertia in the 
mechanical and the electromagnetic senses; Section 5). Because of this, it is 
related to the physical principle of relativity and itself becomes concrete and 
physical. If, however, in place of the Lorentz transformations one discusses 
arbitrary transformations, one ceases to single out that class of coordinate 
systems relative to which the principle of relativity exists, and by doing this 
one destroys the ccnnection between physics and the concept of covariance. 
There remains a purely logical side to the concept of covariance as a consistency 
requirement on equations written in different coordinate systems. Naturally 
this requirement is necessary, but it is always satisfiable. 

In dealing with classes of reference frames that are more general than that 
relative to which a principle of relativity holds, the necessity arises of replacing 
the explicit formulation of the principle by some other statement. The explicit 
formulation consists in indicating a class of physically equivalent frames of 
reference; the new formulation must express those properties of space and time 
by virtue of which the principle of relativity is possible. With the assumption 
of a rigid metrie this is achieved by introducing an additional equation (49*.07). 
We saw in this chapter that with the additional assumption of the absence of 
@ universal scale (A = 0) these equations lead to a generally covariant formula- 
tion of the theory of relativity, without any alteration of its physical content. 
The Galileo—Lorentz principle of relativity is then maintained to its full extent. 

The very possibility of formulating the ordinary theory of relativity in a 
generally covariant form shows particularly clearly the difference between the 
principle of relativity as a physical principle and the covariance of the equations 
as a logical requirement. In addition, such a formulation opens the way to 
generalizations based on a relaxation of the assumption of a rigid metric. This 
relaxation provides the possibility of replacing the supplementary conditions 
(49*.07) by others which reflect better the properties of space and time. This 


leads us to Eimstein’s theory of gravitation, which will be discussed in the 
following chapters. 


CHAPTER V 


THE PRINCIPLES OF THE THEORY OF GRAVITATION 


50. The Generalization of Galileo’s Law 


The most essential characteristic of the gravitational field by which it differs 
from all other fields known to physics reveals itself in the effect of the field on 
the motion of a freely moving body or mass point. In a gravitational field all 
otherwise free bodies move in the same manner, provided the initial conditions 
of their motion, i.e. their initial positions and velocities, are the same. This 
fundamental law may be thought of as a generalization of Galileo’s law that 
in the absence of resistance all bodies fall equally fast. 

It is appropriate to recall at this point the definitions of inertial mass and of 
gravitational mass. Inertial mass is the measure of the ability of a body to 
resist acceleration ; for a given force the acceleration is inversely proportional 
to the inertial mass. Gravitational mass is the measure of the ability of a body 
to produce a gravitational field and to suffer the action of such a field ; in a 
given field the force experienced by a body is proportional to the gravitational 
mass. 

Using these definitions the aforementioned generalization of Galileo’s law 
can be formulated as a statement that the inertial and the gravitational masses 
of any body are equal. 

According to Newton the gravitational field can be characterized by the 
gravitational potential U(z, y, z). The gravitational potential produced by an 
solated spherically symmetric mass M at points exterior to itself is 


_ YM 


- (50.01) 


where r is the distance from the centre of the mass. The quantity y is the New- 
tonian constant of gravitation—in c.g.s. units it has the value 


= 1 cm? 
~~ 15 000 000 g.sec? 


Thus U has the dimensions of the square of a velocity. We note immediately 
that in all cases encountered in Nature, even on the surface of the Sun or of 
super-dense stars, the quantity U is very small compared to the square of the 
speed of light 


Y (50.02) 


; U <c? (50.03) 
In the general case of an arbitrary mass distribution the Newtonian potential 
U it produces satisfies Poisson’s equation 


AU = —4ryp (50.04) 


where p is the mass density. The Newtonian potential is fully determined by 
-Poisson’s equation together with continuity and boundary conditions which 
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are as follows: the function U and its first derivatives must be finite, single- 
valued and continuous throughout space and must tend to zero at infinity. 

Let us assume that the Newtonian potential U is given. The force experienced 
by a body (mass point) of gravitational mass (m),, in a gravitational field of 
potential U is 


F = (m)_ grad U (50.05) 
On the other hand, by Newton’s laws of motion, we have 
(m),w =F (50.06) 
Therefore 
(m),,W = (m),,.grad U (50.07) 


By the generalization of Galileo’s law the motion of the body in a given gravita- 
tional field cannot depend on its mass. Therefore, the ratio of inertial mass 
(m),, to gravitational mass (m),, must be the same for all bodies ; it is thus a 
universal constant whose value can only depend on the choice of units for the 
two masses. In the units quite generally accepted one has simply 

(M)in = (Mgr = (50.08) 
so that inertial and gravitational masses are equal. 

The equality of inertial and gravitational mass is such a familiar fact that it 
is usually accepted as something obvious. However, the matter is not so 
simple : their equality is a separate and very important law of Nature, closely 
connected with the generalization of Galileo’s law. 

As a result of the equality of inertial and gravitational mass the equation 
of motion 

w = grad U (50.09) 
has universal character, and thus formally expresses the generalization of 
Galileo’s law. 

We note that the equations of motion (50.09) can be obtained from the 


variational principle 


8 J (Av? + U)dt = 0 (50.10) 


This fact will be a guide to us in constructing the theory of gravitation. 


51. The Square of the Interval in Newtonian Approximation 
The phenomenon of universal gravitation forces us to widen the framework 
of the theory of space and time which was the subject of the foregoing chapters. 
The necessity of this widening becomes clear from the following considerations. 
It follows from the equation of wave front propagation, which can be stated 


in the form 

L (S)’ du 2 Ow’ 2 Ow) 2 

(3) () y=) em 
that hight is propagated in straight lines. But light possesses energy and by the 
law of proportionality of mass and energy all energy is indissolubly connected 


with mass. Therefore light must possess mass. On the other hand, by the 
law of universal gravitation, any mass located in a gravitational field must 
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experience the action of that field and in general its motion will therefore 
not be rectilinear.| Hence it follows that in a gravitational field the law of 
wave front propagation must have a form somewhat different from the one 
given above. But the equation of wave front propagation is a basic character- 
istic of the properties of space and time. Hence it follows that the presence of 
the gravitational field must affect the properties of space and time and their 
metric is then not a rigid one. This is indeed the conclusion reached in the 
theory of gravitation which we now begin to construct. 

As was shown in Chapter I, the equation of wave front propagation (51.01), 
with some additional assumptions, leads to the following expression for the 
square of the interval : 

ds? = c® dt? — (dx? + dy? + dz?) (51.02) 
The influence of the gravitational field on the properties of space and time 
must have the consequence that the coefficients in the equation of wave front 
propagation and in the expression for the square of the interval will differ 
from the constant values appearing in (51.01) and (51.02). We must now find 
an approximate form for the square of the interval in a gravitational field of 
Newtonian potential U, relying on the generalization of Galileo’s law to guide 
us. The fundamental fact that the law of motion for a body moving freely 
in a gravitational field is a universal one which does not depend on the nature 
of the body permits us to find the relation between the law of motion and 
the metric of space-time. 

The equations of a geodesic in a space-time with given metric were studied 
in Section 38. We shall now try to find a metric such that these equations 
coincide approximately with the Newtonian equations of motion for a free 
body in a given gravitational field. If this attempt is successful it will enable 
us to introduce the hypothesis that in a space-time with given metric a free 
body (mass point) moves along a geodesic ; in this way the connection between 
the law of motion and the metric will be established. 

As we know, the equation of a geodesic may be derived from the variational 
principle 


FY | ds =0 (51.08) 
If the squared interval is of the form (51.02) we have 
ds = +/(c? — v?)dt (51.04) 
2 
or, for small velocities, ds = (< — 5) (51.05) 


Inserting (51.04) or (51.05) into (51.03) gives equations that describe motion 
with constant velocity, which indeed is free motion in the absence of a gravita- 
tional field. We can now assume that for small velocities and weak gravitational 
fields (U < c?) the expression for the interval takes the form 


ds = /(c? — 2U — v2)dt (51.06) 
or ds = f — : (Av? + v)| dt (51.07) 


+ The theory of the deflection of light in a gravitational field is given in Section 59 below. 
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in place of (51.04) or (51.05). Since neither an additive constant nor a constant 
multiplier are of any importance in a Lagrangian the variational principle 
(51.03), with ds taken from (51.07), gives the same result, as the variational 
principle 


3 | (v2 + U)dt = 0 (51.08) 


which was formulated at the end of Section 50, but this did indeed describe 
free motion of a body in a gravitational field. _ 

It is true that just because additive constants and multiplicative factors in @ 
Lagrangian are immaterial equation (51.08) could be obtained from (51.03) and 
(51.06) with any sufficiently large value of the constant c; but we must require 
that in the absence of gravitation, when U = 0, the expression (51.06) for the 
interval shall go over into the Galilean form (51.04) whatever the value of v?. 
This requirement fixes the constant c in (51.06) to be equal to the speed of light. 

These arguments give us good reason to assume that under the conditions 


U<e? 
dz\2 (dy\? (dz\?_ ° (51.09) 
(a) * (a) + (a) =" <: 
the square of the interval differs little from the form 
ds®* = (c* — 2U)dt® — (dx? + dy? + dz?) (51.10) 


Here the relative error in the coefficient of dt? will certainly be of higher order 
than the term 2U/c? which is included. As regards the coefficient of the purely 
spatial part of the interval, it may differ from unity by a quantity of the 
same order as 2U/c?. Indeed, the theory of gravitation to be developed in the 
following sections gives the more exact expression 


ds? == (c2 — 2U) dt? — (2 as =) ae" + dy? + dz?) (51.11) 


Under the conditions (51.09) the difference between (51.10) and (51.11) is 
negligible, as it should be. 

In principle the value found for the coefficient of dt? is capable of experimental 
verification. 

Let us assume that at some point (,, y,, 2,) at which the gravitational potential 
is U,, there is some emitter of radiation of proper period 7'y. The wave radiated 
by it will have a time dependence of the form 


exp (2n 7 51.12 
ap (22 7] (51.12) 


where 7, is not equal to 7’, but is related to it in the same way as dt is related 
to dr, the differential of the proper time of the emitter. If, for simplicity, the 


emitter is assumed to be at rest in the frame of reference chosen, we have 
approximately 


1 


and therefore T= (2 — ar (51.14) 
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In this problem the dependence of gravitational potential on time may be 
ignored, so that the gravitational field can be treated as static. Then the 
wave being propagated from the emitter will retain its time dependence (51.12) 
throughout space. 

Let us now assume further that at some other point (x,, y,, 2), where the 
gravitational potential has a different value U,, there is a second identical 
emitter, e.g. another atom of the same element. The wave emitted by it will 
have a time dependence throughout space of the form 


exp (i2n 7) (51.15) 
Us 
where f= (1 — a)t: (51.16) 


Thus the two waves emitted by identical sources but originating from places of 
different gravitational potential have periods differing by 
T= T= ae Ty (51.17) 
If U, is the potential on the Sun and U, the potential on the Earth we have 
U, > U, and the numerical value of the factor of 7, in (51.17) is approximately 
equal to 

of ace = 2 x 10-* (51.18) 

Cc ‘ 

Thus the wave lengths of spectral lines originating on the Sun must be dis- 
placed relative to the corresponding lines produced on the Earth by-two parts 
in a millon towards the red end of the spectrum. 

However, one must note that the emission of spectral lines on the Sun takes 
place in physical conditions different from those on Earth and that the change 
of period due to the difference of gravitational potentials is to a great extent 
masked by other corrections. 

There are, however, certain super-dense stars, such as the companion of 
Sirius, which have a density tens of thousands of times greater than that of 
water. On their surfaces the value of gravitational potential is significantly 
greater than on the surface of the Sun—twenty times greater in the case of the 
companion of Sirius—for such stars the correction due to the difference in 
gravitational potential becomes very appreciable and can be detected experi- 
mentally. 

Recently (1959) the influence of the gravitational potential on the frequency 
of emitted light was successfully revealed in terrestrial conditions by making 
use of the extremely sharp lines corresponding to nuclear levels. These sharp 
lines are-obtained in the emission of gamima-quanta by atoms in crystals in those 
emission processes which are not accompanied by emission or absorption of 
phonons. In such radiation the recoil momentum is transferred to the whole 
crystal, as a result of which the energy transfer to the crystal proves to be 
negligibly small and the emitted gamma-quantum. possesses practically the same 
frequency as it would have had without recoil. This phenomenon was dis- 
covered by Méssbauer in 1958|46]. We shall not discuss the relevant theory 
here, but will mention that the crystal temperature has a decisive influence on 
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the probability of ‘* phononless ” radiation, but not on its frequency or line 
breadth. Conversely, in radiation accompanied by phonons (which also occurs) 
a considerable line broadening takes place as a result of the Doppler effect, 
connected with the thermal motion of the atoms in the crystal. 

The frequency of the gamma-radiation in the phononless process is in reso- 
nance with the absorption frequéncy of the same line in the absorber crystal, 
and this resonance is so sharp that a relative change of one of the frequencies 
by a quantity of order 10~** is sufficient to destroy the resonance. With such’ 
a sensitivity the change in the potential of terrestrial gravitation in a rise of, 
say, 10m above the surface of the Earth becomes -detectable; for, putting 
g = 9.8 m/sec? and h = 10 m we get 

a ier La 11 x 107% (51.19) 

c c 

Thus, if the absorber atom is above the emitting atom its resonance frequency 
will be greater than the frequency of the gamma-quantum reaching it from below 
(as it rises, the gamma-quantum, so to speak, loses energy); at a height of 10m 
the relative frequency difference is given by (51.19). This difference can be com- 
pensated, and therefore measured, by using the Doppler effect produced by 
moving the crystal as a whole. Here one has to use the transverse (quadratic) 
Doppler effect, because the longitudinal (linear) effect: would be too coarse a 
means of compensating the frequency shift in such precision experiments. 

The theory of the Doppler effect is given in Section 13; the formula for the 
transverse effect is found from (13.16) for 7, = 0. As is clear from the dis- 
cussion of Section 13, the frequency shift in the transverse Doppler effect occurs 
as a result of the relation between proper time z and coordinate time ¢t, while 
there is no correction due to the need to express the instant ¢ of emission in 
terms of the instant (* of signal reception, provided the radial velocity is zero. 
Therefore equation (15.07) may be used directly to derive the condition that the 
gravitational effect be compensated by the transverse Doppler effect. This 
formula can be written as a relation between the frequencies; we put vo for the 
proper frequency of the quantum and v for the local frequency that corresponds 
to coordinate time. We then get 


: 1 a hear U : 51.20 
_= = a fe4 pe f 
Vo ails a )| v ( ) 
If the lower point is the position of the stationary crystal and has the potential 
U», while the potential at the upper point is (’, and the crystal there is moving 
with the transverse velocity +, then the atoms in both crystals will be in 
Tesonance (v. = v,) if the condition . 

30° ++ U, = 0, (51.21) 


is satisfied. Since we have U, — U, = gh, this gives 


v2 = 29h (51.22) 


We see that the experimentally established presence of a frequency shift in 
the gravitational field and the possibility of compensating it by a transverse 


Principles of the Theory of Gravitation 189 


Doppler effect :provide a direct verification of equation (51.07), and by the 
same token, of equation (51.10) for the squared interval. 

Thus, one can consider that the value we have found for the coefficient dt? 
in the metric form is in agreement with experimental findings. 


52. Einstein’s Gravitational Equations 

Einstein’s theory of gravitation in its restricted, non-cosmological, form has 
the following basic idea. 

The geometzical properties of real physical space and time correspond not. 
to Euclidean but to Riemannian geometry. In Chapter III we discussed the 
basic postulates of this geometry. Any deviation of geometrical properties 
from their Euclidean, or to be precise, pseudo-Euclidean form appears in 
Nature as a gravitational field. The geometrical properties are indissolubly 
linked with the distribution and motion of ponderable matter. This relation- 
ship is mutual. On the one hand the deviations of geometrical properties from 
the Euclidean are determined by the presence of gravitating masses, on the 
other, the motion of masses in the gravitational field is determined by these 
deviations. In short, masses determine the geometrical properties of space 
and time, and these properties determine the movement of the masses. 

We shall now attempt to formulate these ideas mathematically. 

In the previous section we saw that in a certain coordinate system, which 
for practical purposes coincides with an inertial frame of Newtonian mechanics, 
the Newtonian potential of gravitation U enters the coefficient of dt? in the 
expression for the square of the interval, i.e. the coefficient go. of the general 
expression 

ds? = g,, dx, dx, (52.01) 


On the other hand, in Newtonian approximation the gravitational potential 
U satisfies Poisson’s equation 
AU = — Arye (52.02) 


The required generalization of Newton’s theory of gravitation must be covariant 
with respect to arbitrary coordinate transformations. Therefore it is impossible 
to regard as a generalization of the Newtonian potential a term in the coefficient 
Joo oT this whole coefficient ; instead the whole set of coefficients g,, must be 
taken into consideration and must appear as the generalization of Newton’s 
potential. The fundamental metric tensor must satisfy a set of equations that 
are generally covariant and in the Newtonian approximation one of them must 
go over into Poisson’s equation for the potential U. The total number of 
equations must, generally speaking, be equal to the number of unknown 
functions, i.e. to the number of components of the tensor g,,, which is ten. 

On the left-hand side of Poisson’s equation there is a second order differential 
operator, the Laplace operator, acting on U. Therefore, the simplest generally 
covariant generalization of this left-hand side will be a tensor which involves 
linearly the second derivatives of the metric tensor g,,,,. 

Such tensors are the curvature tensors (either of second or fourth rank). 
The fourth rank curvature tensor RK, ,, 1s unsuitable because its components 


Hy,a 
do not contain expressions which could be generalizations of the Laplace operator 
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acting on U. Also, it has too many components, the number being twenty, 
twice as many as there are unknown functions.t Therefore, there remains the 
second rank curvature tensor which has just the right number of components. 
On the right-hand side of Poisson’s equation the mass density p appears. 
A generalization of the mass density which has the required tensor character 
i3 the mass tensor T¥” whose invariant is equal.to the invariant mass density. 
We are thus led to the conclusion that the required generalization of Poisson’s 
equation must be a relation between the second rank curvature tensor Re 
and the mass tensor 7’. 
In the. previous chapters we saw that in the absence of gravitational fields 
the divergence of the tensor 7” must vanish 


Vr =0 (52.03) 


We shall retain this equation for the general case, postponing the discussion 
of questions connected with it (questions of the energy of a gravitational field, 
of the integral form of conservation laws, etc.) until Chapter VII. 

But we established at the end of Chapter III that the tensor 


Guy = Re — 1g’R (52.04) 


which is known as Einstein’s tensor, or the conservative tensor, has the remark- 
able property that its divergence is identically zero. : 


V, Gey = 0 (52.05) 
Therefore, if we put 


RY —ig’R = —xT” (52.06) 


where x is a constant, equations (52.03) for the mass tensor will be a conse- 
quence of (52.06). 

As we know, the metric tensor g#” itself also satisfies (52.05), therefore 
we could add to the conservative tensor on the left-hand side of (52.06) a tensor 
of the form 4g” where ) is a constant, without violating (52.03). 

We saw in Section 31* that if one imposes purely local conditions on the mass 
tensor (the condition that it should depend on the field components or other 
functions of state, but not: on the coordinates; the vanishing of its divergence 
by virtue of the field equations) it is only determined apart from two constants. 


To be more precise, if the tensor J“ satisfies the conditions stated, these are 
also satisfied by 


T'™ = aT + Bg (52.07) 


Here the constant « depends on the choice of energy unit, the constant @ also 
on the conditions at infinity. If in supplementation of the local conditions one 
demands that at infinity, where the field vanishes, the mass tensor should also 
be zero, then 8 will be zero and the mass tensor is determined uniquely. 


+ It is true that even an excessive number of equations for the tensor Ruyop Might prove 
to be compatible as is the case for a space of constant curvature (equation (49.12) ) but in 
that case the equations permit the metric to be determined purely locally, i.e. without using 


boundary conditions, They therefore have a character different from that of Poisson’s 
equation for which boundary conditions are essential. 
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These results were established for the case of a Euclidean metric, but one may 
expect that they will be true also for the general case, which differs from that 
discussed in Section 31* in two respects. In the first place, the ordinary de- 
Tivatives with respect to the coordinates in the field equations will be replaced 
by tensor derivatives. In the second place, the quantities g*’ will be added 
to the functions of state (with the statements that their tensor derivatives 
vanish taken, formally, to be their field equations). However neither of these 
facts can essentially change our conclusion on the degree of uniqueness of the 
mass tensor determined from local conditions. The possibility of introducing 
two constants « and 6 is directly evident in the general case too; one may thus 
expect that our conclusion on the uniqueness of the mass tensor also remains 
valid (of course, it would be desirable to have a formal proof).t 

Replacement of the tensor 7” by the linear function (52.07) of itself corre- 
sponds to replacing the gravitational equations (52.06) by 


RY —igR = —xT — dg (52.08) 


The constant A is called the cosmological constant. It is clear from these re- 
marks that the question of the value of \ acquires a definite meaning only after 
the conditions are formulated by which the tensor T“” is to be defined uniquely. 
Such conditions will necessarily be of non-local character; they can therefore 
only be formulated starting froin definite assumptions about the character of 
space-time as a whole. 

At the beginning of this section we stated the basic assumptions of Einstein’s 
theory of gravitation in its limited (non-cosmological) formulation. According 
to these the concept of spatial infinity retains its meaning, space-time at infinity 
being pseudo-Kuclidean (Galilean). Deviations from Euclidean character are 
observed only at a finite distance from massive bodies. But in this case the 
mass tensor may contmue to be subjected to the requirement stated for the 
case when the whole of space-time was, assumed pseudo-Euclidean. We can 
demand that at infinity, where the field vanishes, the mass tensor should also 
become zero. It then has a uniquely defined meaning to inquire after the value 
of the cosmological constant and the answer can be based on the following con- 
sideration. According to-our basic postulates, the absence of a gravitational field 
signifies the absence of deviations of the geometry of space-time from the 
Euclidean, and therefore also the vanishing of the curvature tensor R*” and of 
its invariant R. On the other hand, the gravitational field will be absent if the 
mass tensor 7"’ is zero everywhere. Therefore the equations T¥’ =- 0 and 
R*’ — 0 must certainly be compatible and this is only possible if the equations 
relating GY’ to T*’ do not contain the term Ag"” (i.e. if A = 0). 

Thus, given the assumptions formulated at the beginning of this section and 
given our definition of the mass tensor, the appropriate generalization of 
Poisson’s equation for the potential will just be equations (52.06). 

As for equations (52.08), they should be used if the problem is stated cos- 
mologically, in which case the concept of spatial infinity is inapplicable and the 
tensor 7’ contains an unknown constant 6, even after units have been fixed. 
According to the value of this constant the value of the so-called cosmological 


+ A proof of uniqueness for the electrodynamic and the electromagnetic cases is given in 
the papers by C. Jankiewicz [47]. 
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constant A must be selected; it is evidently related to 8. The choice of some 
particular value of 4 for a given normalization of T*Y represents a special hypo- 
thesis, which must be introduced explicitly; this is true also for the value 
A= 0. 

Returning to the non-cosmological case and to equations (52.05) we may 
assert that under the conditions stated (correspondence with Poisson’s equation, 
general covariance, linearity in the second derivatives of the g*’, identical 
vanishing of the left-hand side of (52.05) and Euclidean character in the absence 
of masses) these equations are walque. 

The equations (52.06) are called Einstein’s gravitational equations ; they play 
a fundamental part in the theory of gravitation. They will be examined in the 
following sections. 


53. The Characteristics of Einstein’s Equations. The Speed of Propa- 
gation of Gravitation 


We begin our discussion of Einstein’s gravitational equations 
Re — }g"R = — xT (53.01) 


with the derivation of the first order equation for their characteristics. From a 
physical point of view the equation of the characteristics represents the propa- 
gation law for the wave front of a gravitational wave. 

Multiplying (53.01) by g,,, and summing we obtain the relation 


R=xT (53.02) 


connecting the invariants of the curvature tensor and of the mass tensor. 
This relation enables us to write the gravitational equations in the form 


RY = — x(Te — lige) (53.03) 
In Appendix B the contravariant curvature tensor R*”is shown to be expressible as 


Re 1,28 0"g — 
0 aa ae, FT (53.04) 
where I“.*° is the quantity obtained from I, by raising suffixes 
TPe.c8 = groghop (53.05) 


Therefore, the last term in (53.04) does not involve second derivatives but is a 


homogeneous quadratic form in the Py, and hence also in the first derivatives 
of the metric tensor. 


Second derivatives appear in the first term and also in the I™’, but the 
latter dependence is only through first derivatives of the quantities 


IY = g*Iv, (53.06) 


which were introduced in Section 41. We recall that the d’ ' 
function may be written in the ee at the d’Alembertian of any 
any ay 
im = 6 —_—— —_]v 
» ® On Ox r 


« OL, On, 


(53.07) 
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or, alternatively, as 


Pee Mee ee ec 

Oy ~ a/(— 4) an, \v 9) g 8 Oz, (53.08) 

whence 
| a 
[eS ee — p 

Woah a, (Vi 9) (53.09) 
and also 

Il = —(h, (53.10) 


The [#’ are obtained from the I’ by a rule which is formally identical with the 
rule for forming the symmetrical contravariant derivative of a vector: 


re = avery + wry (53.11) 
or in detail 
1/_ ar are age 
yey — _| gee VO a . 
5(s ae a ear pe) 


Of course, since I’ is not a vector the I’ are not a tensor. This circumstance 
proves very useful in simplifying Einstein’s equations. 

Einstein’s equations are generally covanant and therefore permit coordinate 
transformations involving four arbitrary functions. Suppose the equations 
are solved in some arbitrary system of coordinates. We can then go over to 
other coordinates by taking as independent variables four solutions of the 
equation [}) = 0. These solutions may be chosen in such a way as to satisfy 


the inequalities to which the g"” must be subject, according to Section 35, and 
they may also be subjected to some additional conditions. But as long as each 
of the coordinates x, 21, Z, and 2, satisfies the equation (Jz, = 0 we shall 
have in that system 

I*=0 (53.13) 
and therefore also 

rm =0 (53.14) 


We shall call such a coordinate system harmonic. At the moment we are not 
interested in the question of the uniqueness of the harmonic coordinate system 
or rather in the additional conditions which could guarantee uniqueness. This 
question will be considered in Section 93. Here it is only important to note that 
the equations (53.13) are compatible with Hinstein’s equations and that they 
do not impose any essential limitation on the solutions of the latter, serving 
only to narrow down the class of permissible coordinate systems. 
Under the conditions (53.13) the expression for the R#Y simplifies, becoming 
Mem. : a*g’ Be 
RY = 40" 5 os, hese (53.15) 
Hence second derivatives only appear combined in the d’Alembert operator 
acting on the single quantity g” which has the same indices as the particular 
R*’ on the left-hand side. 
The form of the equation of the characteristics for any given system of 
equations depends only on the terms containing the highest occurring order 


¢ The conditions '* — 0 were first introduced by de Donder [16] and by Lancaos [17]. 
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of derivatives. In the case of the system (53.01) and (53.13) these terms are 
just those involving the d’Alembertian. 

Therefore the system of equations of gravitation will have the same characteris- 
tics as d’Alembert’s equation, 


Oy =0 (53.16) 
and these are easily found. As shown in Appendix C they have the form 
given ees) (53.17) 
6x, Ox, 
where W(X, £1, Ly, Lz) = const. (53.18) 


is the equation of a wave front, i.e. the equation of a moving surface on which 
any discontinuities of the gravitational field must lie. 

The equation (53.17) for the propagation of a gravitational wave-front is the 
same as the corresponding equation for the front of a light wave in empty 
space on which the whole theory of space and time was based.t Briefly one 
can say that gravitation is propagated with the speed of light. 

That in Einstein’s theory gravitation is propagated with the speed of light 
is a fact of fundamental significance. It shows that the assumed form of the 
gravitational equations is in agreement with the general postulate of the 
Theory of Relativity according to which there exists a limiting velocity for 
the propagation of all types of action, namely the velocity of light in free space. 
The existence of a finite propagation velocity for gravity removes the contra- 
diction inherent in Newton’s theory of gravitation with its admission of 
instantaneous action at a distance. 


54. A Comparison with the Statement of the Problem in Newtonian 
Theory. Boundary Conditions 


In Newton’s theory of gravitation the gravitational potential satisfies the 
equation 


AU = — 4zye (54.01) 
and tends to zero at infinity in such a way that 

lim rU = yM (54.02) 

T—> 0 


where M is the total mass of all the bodies of the system in question and is 
equal to 


M= e dx dy dz (54.03) 

Einstein’s theory, which is based on the gravitational equations 
Re — kg/R = — xT (54.04) 
must, in first approximation, give the same result as Newton’s theory. Newton- 


ian theory is applicable to such mass distributions for which the total mass, 


“+ When we derived that law from Maxwell’ 
to be Euclidean. But, according to a remar 
be obtained without this assu mption, starti 
equations given in Section 46. 


s equations in Section 3, we assumed space-time 
k at the end of Appendix E, the same result can 
ng from the generally covariant form of Maxwell's 
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given by the integral (54.03) taken over all space, remains finite. This condition 
is in particular satisfied by any mass distribution of insular character. We 
use this term to describe the case that all the masses of the system studied 
are concentrated within some finite volume which is separated by very great 
distances from all other masses not forming part of the system. When these 
other masses are sufficiently far away one can neglect their influence on the 
given system of masses, which then may be treated as isolated. 

In formulating Einstein’s theory we shall likewise start from the assumption 
that the mass distribution is insular. This assumption makes it possible to 
impose definite limiting conditions at infinity as for Newtonian theory, and so 
makes the mathematical problem a determined one. 

Theoretically, other assumptions are also admissible. For instance, one can 
assume a mass distribution which on the average is uniform throughout space. 
Such a point of view is appropriate to the study of distances so enormous that 
in comparison even the distances between galaxies are taken to be very small. 
Very little is known of the mass distribution over such great distances and 
therefore a theory dealing with them will necessarily be less reliable and less 
capable of experimental verification than the theory of smaller scale astro- 
nomical phenomena. 

The bulk of this book will be devoted to the case of insular distributions of 
masses. The assumption of uniform distribution will be considered only in 
Sections 94 and 95, where we give the theory of Friedmann-Lobachevsky space 
to which this assumption leads. 

We shall thus now assume that space-time is in the main Euclidean, or 
rather pseudo-Euclidean, and that any deviation of space-time geometry from 
Euclidean geometry is a result of the presence of a gravitational field. Where- 
ever there is no gravitational field, geometry must be Euclidean. For an insular 
distribution of masses the gravitational field must tend to zero at infinity and 
therefore we have to assume that at points far removed from the masses the 
geometry of space-time becomes Euclidean. However, when geometry is 
Euclidean there exist Galilean coordinates x, y, z and ¢, in terms of which the 
square of the interval has the form 

ds? = c? dt® — (dx® + dy? + dz?) (54.05) 
Since experiment shows that the geometry of space-time nowhere deviates 
greatly from Euclidean geometry one may expect that there should exist in 
the whole of space variables in terms of which the square of the interval deviates 
but little from (54.05). In the following we shall give a more precise definition 
of these quasi-Galilean coordinates. 

We note that Newton’s theory is simplest to formulate in just these Galilean 
coordinates, i.e. in an inertial frame of reference. Consequently Einstein’s 
theory, which is its generalization, should be compared with it in terms of 
coordinates with as similar properties as possible. 

Newtonian theory is non-relativistic and in passing from a relativistic theory 
to a non-relativistic one, it is essential to single out the speed of light as a large 
parameter. Therefore we shall no longer introduce the quantity c into the 
definition of the time-coordinate ; instead of (35.03) we shall now write 


w= hs 2, =2; C=: Bq =z (54.06) 
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Thus henceforth the variable z, will mean simply the time ¢ and not ct as 


previously. 
The expression (54.05) for the square of the interval now appears as 


ds? — eda? — (da? + daz + da5) (54.07) 
This must be valid at sufficiently large distance from the masses, where the 


geometry is Euclidean. 
Comparing with the general expression 


ds? = g,,,dx,dax, (54.08) 
we find that the g,,, must have the following limiting values at infinity 
(Joo)c = c? ; (Joi)o =a 0 (54.09) 


(git)o = —Sxe (i, = 1, 2, 3) 
The corresponding limiting values of the contravariant components of the 
metric tensor will be 


(9°")oo = 3 (Go = 0; (goo = — Sux (54.10) 


These are then to be considered as the boundary conditions on the metric 
tensor. 

However, the number of boundary conditions so far stated is insufficient ; 
some additional ones must be added which characterize the asymptotic behaviour 
of the differences g,,, — (9,y)oo at large distances from the masses. 

In the previous section we saw that, at least if IY = 0, Hinstein’s equations 
are of the type of the wave equation, because their main terms involve the 
d’Alembert operator. Outside the mass distribution the tensor 7*” vanishes 
and the equations take on the form 


RY =0 (54.11) 
where, provided I'Y = 0, the tensor R*Y has the form 


Rw = — } g* 


o*gty 
ea (54.12) 


ra) Lg 


We assume that at large distances the differences g4” — (g*”),. and their first 
and second derivatives tend to zero as l/r, where r = 4/(2? + 22 4 22). (This 
assumption will be justified in the following.) Then at large distances the 
second term of (54.12), being a homogeneous quadratic form in the first deriva- 
tives, will tend to zero as 1/r?. As for the term involving the d’Alembertian, 
the coefficients in it can be replaced by their limiting values to the same 
approximation. After these simplifications we get 


Rwy — 1 oe (S argu er) 


28 og  B\ Gok * “aay Oak 


Y (54.13) 
A more, complete investigation of the asymptotic behaviour of the g"” will 
be given in Section 87. It shows that the asymptotic form of g*’ is influenced 
ae terms of order 1/r? omitted from (54.13) but that qualitatively the 
ehaviour of the difference g” — (9%), will be the same as the behaviour of a 
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function t) satisfymg the wave equation 
1 a%y 
—_— —AJ= ‘ 
c? at? ae bats) 
where A is the usual, Euckdean Laplace operator. 
We are interested in solutions of the wave equation (54.14) which correspond 
to outgoing waves dying off at infinity. They have the asymptotic form 


= “i(' af “n) (54.15) 
where n is a unit vector with the components 


a 
Nz = - 
r 


y z 

; Ny ==; nz=- (54.16) 
r r 

and f is an arbitrary function. The function f and its derivatives with respect 

to all its arguments are assumed finite. The argument n gives the dependence 

of f on the direction along which a point recedes to infinity. 

Other possible solutions of the wave equations must be discarded for physical 
reasons, for in our statement of the problem the system is considered to be 
isolated. This means that no waves impinge on it from outside, all waves have 
bodies of the system as their sources and, since in a system of insular type all 
bodies are concentrated in a finite region, all waves originate in this region 
and so have the asymptotic form (54.15) at large distances from the region. 

The conditions that a solution of the wave equations should at large distances 
have the form indicated can be stated in the differential form 

; (rp) 1 arp) 
lim (= +22) oat (64.17) 

This condition must hold for r—> oo and all values of tj; =¢-+7/c in an 
arbitrary fixed interval. It can be called the condition of outward radiation. 
It ensures the uniqueness of the solution provided it is associated with the 
requirement that the function y and its first derivatives with respect to z, y, z 
and ¢ should be everywhere bounded and should die off at infinity as 1/r (see 
Section 92). 

We note that the above considerations refer strictly speaking to the ordinary 
wave equation (54.14) and not to Einstein’s equations. Therefore the asymptotic 
form of the difference 


gh — (9a = (54.18) 
will, in fact, differ somewhat from (54.15). However, a slightly modified condi- 
tion of outward radiation written in the differential form (54.17) will still be 


valid for (54.18). 
Summing up, we can say that in our statement of the problem the metric 
tensor must satisfy the condition of being Euclidean at infinity and also the 


condition of outward radiation. 


55. Solution of Einstein’s Gravitational Equations in First Approxi- 
mation and Determination of the Constant 


To compare the gravitational theories of Einstein and of Newton we must 
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first of all determine the constant x which enters Einstein’s gravitational 
equations 

RY — tq"R = — xT (55.01) 
The value of this constant can be found by comparing the expression for the 
square of the interval derived in Newtonian approximation in Section 51 with 
that obtainable by approximately solving Eimstein’s equations. 

For the mass tensor on the right-hand side of (55.01) we may take the 
approximate expression corresponding to a Euclidean metric that was dis- 
cussed in Section 32, There the mass term was given explicitly for the case 
of an elastic body. In taking over the equations of that section we must re- 
member that we now understand z, to mean simply the time ¢ and not ct as 
previously. Therefore the previous 7 will in the new notation be equal to 
c?T and the previous T™ to the new cZ™, the meaning of T** being unaltered. 
Thus if x) =¢ the equations (32.34) become 

oD = p + S (200? + ell) 


1 : 
c?T% = py, + a {v4 (pv? + pl) = pikrK} (55.02) 
=] 
of [tk = pure — pir 
In the present approximation we must disregard the terms corresponding to the 


density and current of energy—Umov’s scalar and vector—and we write simply 
C77 = 54 o2T% = py, (55.03) 


To the same accuracy to which this is valid we may replace the invariant of 
the mass tensor by the value 


T=e (55.04) 
Equations (55.03) and (55.04) enable us to calculate the approximate values 


of the tensor components entering the right-hand side of Einstein’s equations 
written in the form 


Rw — — x(T’ — 4gT) (55.05) 
which was given in (53.03). Using the Galilean values of the g*’ we get 


1 
T% — Lgoop — ve 0 


; 1 
T% — Aoup — =a 0M (55.06) 
TH — LgtkT — LoSy 
On the other hand, according to (54.13), if we use harmonic coordinates 
we have approximately, 
1 o*guy 


Rw — Aq — 
bAg 2c? at? 


(55.07) 


where A is the usual Euclidean Laplace operator. As we shall be interested in 
& quasi-static solution we can discard the term involving the second time 
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derivative. Inserting (55.07) and (55.06) into (55.05) we have 


x 
Ag = — a ° 
. 2x 
Ag* = — 3 eu% (55.08) 
Ag*.= —xodix 


We now refer back to the expression for the squared interval in Newtonian 
approximation. According to (51.10) we have then 


Joo = c? — 2U (55.09) 


where U is the Newtonian potential. In this approximation the remaining 
components of the metric tensor are to be replaced by their Galilean values. 
Using the relation 


3 
Gog"? + > Gog” = 1 (55.10) 
it 


and the fact that the products g,,9' are very small compared to unityt we can 
take 


Joog* = 1 (55.11) 
and therefore 
1 20 
al ar (55.12) 
But Newton’s potential satisfies the equation 
AU = — 4rye (55.13) 
hence 
Sry 
Ag” == ow e (55.14) 


Comparing this with the first equation in (55.08) we see that the two are 
coincident, if Einstein’s gravitational constant x is related to Newton’s constant 


y by the relation 
sas OY (55.15) 


The Newtonian potential U is that solution of (55.13) which satisfies the 
appropriate boundary conditions at infinity. As is well-known that solution 
can be written in the form of a volume integral : 


= eee da’ dy’ dz’ 55.16 
Ua=1| pipe (66.16) 

Side by side with this Newtonian potential we introduce the functions 

(er) , , f 
= dx’ dy’ d 55.17 
Dem | pee ey (66.17) 
which satisfy 

AU; = — 4rryo% (55.18) 


ft An estimate of these terms will be given later. 
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and also the conditions at infinity. In analogy with the corresponding electro- 
magnetic quantities these functions may be called gravitational vector poten- 
tials. Now the solution of (55.08) can be written as 


4 
gi = 0, (55.19) 


We have used (55.15) to eliminate x. 

We note that U has the dimensions of a velocity squared and the U; the 
dimensions of a velocity cubed. In estimating the orders of magnitude of 
quantities we can take U to be of the order g? and the U; of the order q°, where 
q is some speed much smaller than the speed of light. 

It is now purely a matter of algebra to obtain from the contravariant com- 
ponents of the metric tensor its covariant components, its determinant, etc. 

To simplify the algebraic manipulations we introduce a system of quantities 


eer ae FoFor . atk — — gtk (55.20) 
Joo 
where 7, k = 1, 2,3. It is easy to verify that 
3 
> Oza = Bi (55.21) 
m=) 


The set of quantities a, may be interpreted as a three-dimensional spatial 
metric tensor, but only its algebraic properties are of importance to us here. 
If we put 


a = Det Qik (55.22) 
and therefore 
1 
repeat (55.28) 
a 
we get 
J = — Yoo (55.24) 
It follows directly from the definitions (55.20) that 
3 
Joog™ = > amkg ig (55.25) 
m=) 
and also 
3 
Fio = Joo D atxg (55.26 
v2 ) 


If the g#” have the values (55.19) it follows that 


2U 
atk = (1 = =) (55.27) 
and therefore 
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2U 
atk = (1 ay JB (55.28) 
c 
Noting the form of go) we get 
Jootik = COE (55.29) 


with an error of order higher than U/c?. 
Hence, with the same relative error, we have 


Jig = 07g" (55.30) 
Using these results we obtain for the covariant components of the metric tensor 
Joo = c? — 2U 
4 
Joi = = Uy (55.31) 


Knowing the approximate values of the gy; and the g™ we can now verify the 
accuracy to which the use of (55.11) was justified. We have, approximately 


16< 
Jog” = 1 — => U} (55.32) 
i=1 


If the U, are of order g? the above expression differs from unity by quantities 
of order q®/c*. Therefore (55.11) may be used not only in this but also in the 
next higher approximation in U/c® or v?/c?. We note that (52.26) leads to 


3 
Joo9°” = 1 — Goo yy ang "g°* (55.33) 
k= 
Here gy is positive and the double sum is a positive definite quadratic form, 
therefore we shall always have, quite rigorously 
Gog"? < 1 (55.34) 


though, as we have seen, the deviation from unity of the left-hand side is 


exceedingly small. 
We now state the value of the determinant g and of «/(— g) times the contra- 
variant components of the metric tensor. We shall write 


gt’ = 4/(—g)-g"” (55.35) 
We then have 
—g=ct+4U (55.36) 
and therefore 
2U 
V(-9) = 6 + (55.37) 
Hence, using (55.35) 
sees 1 " 4U 
aa c 3 
; 4 
g% = Ue (55.38) 
gtk = — Bix 


We must now estimate the magnitude of the neglected terms in RY’ which 
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ic 1 | ivatives. 
are quadratic in the first deriva 
These terms are of the form I.*I¥,. To calculate approximate values of 


the Christoffel symbols we could use the approximate form of the metric tensor 
that has just been derived. However, we shall not perform these calculations 
here as the quadratic terms will be determined in detail in Chapter VI where 
we shall solve the gravitational equations in the next approximation. Here 
we only need the order of magnitude of the quadratic terms. It turns out 
that the terms in R® and R% will be of sixth and those in Rk of fourth order 
in 1/c. In our present approximation these terms have no influence. 
It remains to verify whether the conditions which ensure that the coordinates 
are harmonic, ; ; 
= — —— : —{v(—- 9):‘9"} = 90 (55.39) 
= ea aa, (V 9)-9""} 


are satisfied to the approximation required. Let us first make it clear to what 

accuracy we require them to hold. If we do not omit the terms in I” in the 

expression (53.04) for R’ but instead retain them to the accuracy corresponding 

to the approximation (55.07) for the other terms, we obtain in place of (55.07) 
1 ag 1 gfe 


00 — 1 O04) 2 LS eee Me 
He ad Qc? a2 c® at 
1 age = 1 x) 
i— IAAgti_ ow Cf 55.40 
oe 2Ag 2c? at? a 2\ dx, cc? A ( ) 
1 oye L(t ae 
RST hgh = 
: Qc? at? 2\dr, G% 


In order that the previously neglected terms in I'Y should really be small com- 
pared to terms of the type Ag” which were retained, it is necessary that I 
should be of a higher order of smallness in ¢ than 1/ct and I¥ of a higher order 
than 1/c?. These conditions are indeed satisfied. For it is directly evident 
from (55.38) that the I“ will be of fourth order in 1/c. As for P°, the terms of 
fourth order in it are 


4(0U au, 
Pos eS eee 41 
als +>) eb) 
These must vanish. Therefore the equation 
wu a, 
meee —. = 0 - 
oe 2 > (55.42) 


must hold. As is evident from the definition of U and the U; (either by means 
of differential equations with boundary conditions or in terms of volume 
integrals) this equation is indeed satisfied as a consequence of the relation 


a () 
a m8 (55.43) 


t=1 


which expresses the law of mass conservation in Newtonian approximation. 
Thus the expressions just derived for the metric tensor satisfy to first approxi- 

mation not only the gravitational equations but also the “harmonic con- 

ditions”. In addition they obviously satisfy the boundary conditions at 


Principles of the Theory of Gravitation 203 


infinity. The expression for the square of the elementary interval corresponding 
to the expressions derived is 


; 2U 
ds? = (c2 — 2U) dt? — (1 4+ (aah + dz + dzz) 
8 
+5 (Uda, + Udo, + Usdes) dt (55.44) 


Usually the terms involving the products da; dt are of no importance. Omit- 
ting them we get the expression 


ou 
ds? = (c2 — 2U) di? — (1 + =) (ae (55.45) 


which involves only the Newtonian potential. This expression has already 
been quoted without proof in Section 51, equation (51.11). 


56. The Gravitational Equations in the Static Case and Conformal 
Space 
The metric tensor is called static if its components do not depend on the 
time coordinate zy = t, so that ° 


09,, 
ay — 0 . 
Ai (56.01) 


and if, in addition, 
Jo = 0 (¢ = 1, 2, 3) (56.02) 

It is evident from physical considerations that if several masses are present 
they must be in motion}. Therefore a static metric tensor can only occur in 
the case of a single mass. In spite of limited applicability, the static case is of 
some physical interest, first of all because it permits a deeper insight into the 
analogy with the Newtonian theory of gravitation (which is also a static theory) 
and also because in the static case it is easy to discuss the question of the 
uniqueness of the solution. Also, rigouros solutions of Einstein’s equations can 
be found in this case. 

Under the conditions (56.01) and (56.02) the time coordinate is determined 
uniquely, while the space coordinates permit a group of transformations among 
themselves. Therefore it is natural, in this problem, to use the apparatus of 
three-dimensional tensor analysis and to write the gravitational equations 
accordingly. Three-dimensional tensor analysis can be applied either directly 
to the spatial part of ds?, or else to this spatial part multiplied by some factor. 
Remembering the approximate form of (55.29) obtained from Einstein’s 
equations, we introduce into the spatial part a factor inversely proportional to 
the factor in the time part, putting 


1 
ds? = c?V? dt? — pe hey day dy, (56.03) 
We shall consider the quantity V? to be a three-dimensional scalar and the 
quadratic form 


da? = hix dx dry (56.04) 


¢ The problem of the motion of a system of masses is considered in detail in Chapter VI. 
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to be the squared length of arc in a certain auxiliary space, which we shall call 
conformal space. Three-dimensional tensor analysis will be applied to this 
conformal space. As may be seen by comparing (56.03) with (55.45), in the 
approximation in which the latter holds we can assume the conformal space 
to be Euclidean and the quantity V? to be related to the Newtonian potential U. 
Thus we shall have 
Jo = CV", Ju = 9, 


hex (56.05) 
Gik ay ae ye’ 
and also 
1 
00 _ i 0, 
oS eye 9 (56.06) 
gik = —V2hik 
Here the quantities hj, and h** are connected by 
hayhki = hk (AE = 8*) (56.07) 


this relation bemg analogous to (37.18) for the g,,. Denoting by h the deter- 
minant of the hix we easily obtain 


V(-9) = a VA (56.08) 


Therefore we have 


— eV4 (56.09) 


and the d’Alembert operator (41.11) applied to a function i may be written as 


1 a : 
Lp = ape ae 7 VAWs (56.10) 


where (A), denotes the Laplace operator in the conformal space : 


Vh Ox; 


Hence we see that spatial coordinates that are harmonic in the four-dimensional 
sense will also be harmonic in the three-dimensional conformal space. 

We denote the four-dimensional Christoffel symbols (formed from the metric 
tensor g,,,) by (I,), and the three-dimensional Christoffel symbols (formed 
from the metric tensor hix) by (Vix)n. Similarly we shall attach suffixes 9 
and A respectively to quantities that are tensors in relation to the metrics 
(un) and (hex). 

The four-dimensional Christoffel symbols and the curvature tensor can be 
expressed in terms of the corresponding three-dimensional quantities. 

These expressions will involve derivatives of the three-dimensional scalar V, 


1 @ a 
(Av) = hig (Ve) (56.11) 
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which we will denote by ue 


av 
(V4), = AEV;, (56.12) 


k= 
: On, k 


The suffix h attached to the V# will sometimes be omitted for brevity. 
The Christoffel symbols with purely spatial indices will be 
Ce) Vi a Vi 


ey | ile 
ye ey ey 


(Ving = (Vig), + hax (56.13) 


If one or all three indices are zero, the Christoffel symbols become zero: 
(™),=0, (),=90, (Fh), =0 (56.14) 


Finally, if two of the indices are zero we get 


y 
Tie = VV; (Che = = (56.15) 
Using the general formula 
are are, 
e — on Te _ [Te p 
Go, RV ex, 0x, GL ay av re. (56 16) 


we can express the four-dimensional curvature tensor in terms of a three- 
dimensional tensor and the covariant derivatives of the three-dimensional 
scalar V. Leaving out elementary, though rather tedious, calculations we obtain 
for the components with four spatial indices 

= (V4, Yn 


(Ri mea = (Ri, mk)h 7 ieee V (Vi, )r —hrs V 


aay La (Ks )n 


where (Vix), is the second covariant derivative of V with respect to the 
metric (h): 


(Vix)n amg (himh', = high! ee (ViVA)n, (56.17) 


WT yp? 


eV ji. OV 
= —— — — 56.18 
(Vik)n BaieE (Vix)r on; ( ) 


and (V4), is the same derivative with a suffix raised: 


(Vi), = he Vin)y (56.19) 
If only one index is zero, we have 
(Remedy = 9  (Rimeg =  (Rionde = 0 (56.20) 
lf two indices are zero we get 
( Dankele =0 (56.21) 
and also 
] 2 
(Reowle = Vien + aa VeVi — Pew’ a (VV), (56.22) 
and finally 


(Rb og = VAVVE, + 2V'V, — WV, Vb (56.23) 
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It is now simple to form the contracted curvature tensor which enters 


Einstein’s gravitational equations. 
Using the formula 


(Rinlg = (Rrmela + (Rioxdg (56.24) 
we obtain from (56.17) and (56.22) 
1 2 1 
(Riz)g = (Riz)n — hie (AV), + fi ViVi + his (V;V4)_ (56.25) 
where (AV), is the Laplace operator applied to V: 
1 @ ov 
= (VP), = = [VA 56.26 
AM = Von Sh bx (v ina) nee 
On the other hand the formula 
(Roo), = (Re x0)g eee (Ro ong (56.27) 
gives, using (56.23), 
(Roo)g = —c?V2{V-(AV)n — (ViV4)n} (56.28) 
As for the mixed components of R,,,, they vanish as a consequence of (56.20): 
(Ro:), = 0 (56.29) 


By virtue of the relations (56.06) between the four- and three-dimensional 
metric tensors, equations (56.25), (56.28) and (56.29) lead to the following 
expression for the invariant (R),: 


(R)g = —V>{R)n + 2V(AV), — 4(Vi VA), (56.30) 
We denote by G,, the conservative Einstein tensor 
Guy a (Bis) a 39,(2), (56.31) 
and by Hi, the conservative tensor in the conformal space 
Hu = (Rix)n — dha R)r (56.32) 
The invariant of the latter is 
H = htkH,, = —4(R), (56.33) 


as a result of which we have 


(Rize = Hix — hye (56.34) 
For the spatial part of the conservative Einstein tensor we obtain the simple 
expression 
2 
pi 
which is remarkable for the fact that it does not contain second derivatives of 
the three-dimensional scalar V. 

The mixed components of the conservative tensor vanish, 


1 
Gik = Hix +— VV, — hina (V;V)n (56.35) 


Gy = 0 (56.36) 
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while the component G'gy is given by 
Goo = €?V2{—V2H — 2V(AV)», + 3(V4V4)n} (56.37) 


We go over to the formulation of the gravitational equations. We have just 
noted that the quantities Gi, do not contain second derivatives of V. On the 
other hand, (56.28) shows that Ry) does not involve second derivatives of the 
hix. Therefore, if we write down the gravitational equations in a form solved 
with respect to Ro, and Gi, the second derivatives of V and of hiz will be 
separated from each other. By virtue of the general equations 


R,, = —X(T,, — 39.7) (56.38) 
which are the covariant form of (53.03), we have 
Roo = —X(T 0 — 49007) (56.39) 
where T is the invariant 
T=(T),=(T3+ Ti 472+ 1%), (56.40) 
Hence by virtue of (56.05) we get 
Rog = —3xe?V(T) — T} — T;— T)), (56.41) 
and, by using the value (56.28) of Rog: 
V-(AV)n — (ViVI)n = 3x(T) — Ti — T? — T3), (56.42) 
The equations for the spatial components are 
Hix + oi ViVe — heer ( ViVI\n = —xT tx (56.43) 


As regards the equations for the mixed components, 
Gig = —uT in (56.44) 


they are satisfied identically, because here the left-hand side is zero by virtue 
of (56.36) and the right-hand side is also zero, because the mass current is zero. 

The equations so obtained acquire a more pictorial form if one introduces 
some new symbols. 


We put 
] 
p= 53 (Po — Ti —T2 — Ta)o (56.45) 


The quantity » can also be written in the form 


w= C7700 4 AikT i, (56.45’) 
As may be seen by comparing this with the approximate expression (55.02) 
the quantity u represents a certain mass density; we shall see that it can be 
interpreted as the mass density in the conformal space. 
Further, we replace V by the quantity ¢, according to the formula 


V =e? (56.46) 
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so that the relation between the space-time metric and the metric in the con- 
formal space takes on the form 


ds? = c®e-%* dt? — e®* do?® (56.47) 
we have 
a V 
OD d= — se 
x4 V 
7 (56.48) 
AV V;Vi 
a ees ial 23 
Therefore, the gravitational equations (56.42) and (56.43) may be written as 
(Ad)n = —} (56.49) 
Ain = —2dibx Si hil Bib) — “Tix (56.50) 


The first of these equations is essentially Poisson’s equation for the New- 
tonian potential U. Indeed, the symbol A is a generalization of the Laplace 
operator, » is the mass density and, by (55.15), the constant x is given by 


oa a (56.51) 
Therefore if we put 
= _ (56.52) 
ce 
the quantity U will satisfy the equation 
(AU), = —4rryu (56.53) 


which differs but slightly from equation (55.13) for the Newtonian potential. 
We can also put 


g=% (56.54) 


where g; is a component of the gravitational acceleration. 
Let us now clarify the physical meaning of (56.50). Apart from a factor the 
terms involving the ¢; can be interpreted as gravitational stresses. If we put 


2 die — hi Pih)» = “Thy (56.55) 
we can replace (56.50) by 
Hy, = —(TR, + T,,) (56.56) 
The three-dimensional divergence of the tensor Tix, understood as referring to 
the metric (Aix) is 
2 
(ViT FR) = = fi(Ad)n (56.57) 
and by (56.49) we have 
(VETTE), — Udi (56.58) 


: He the other hand, since Hi is a conservative tensor in the conformal space, 
its ee ee 
ivergence is zero. Therefore, apart from its sign, the divergence of the 
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gravitational stress tensor is equal to the divergence of the tensor of elastic 
and other static stresses T;,. Thus we have 


(VET sx)n = di (56.59) 


These equations represent a generalization of the usual equations in the statics 
of elastic bodies in a gravitational field. 

The equations for the statics in conformal space, written in the form (56.56) 
stand in analogy to Einstein’s equations in space-time. In both sets of equations 
the left-hand side involves a conservative tensor, while on the right there is a 
stress tensor or its four-dimensional generalization. Here the gravitational 
stresses appear in explicit form only after space has been split off from time 
and after passage to the conformal space. 

The conformal space will be almost Euclidean. Indeed, as is seen from (56.54) 
and from the estimates (55.02) for the tensor T;, the right-hand side of (56.56) 
will be of the order g?/c?. This leads to the result that the deviation of the hix 
from their Euclidean values will be of the order U*/c*. This result is in agree- 
ment with the approximate formula (55.45), which was just the basis for 
introducing the conformal space. 

For empty space, when T,, = 0 and pw == 0, equation (56.49) is a consequence 
of (56.50). It is easy to see this by equating the divergence of Hix to zero and 
using (56.57). 

If the mass tensor T,,, is zero in the whole of space, the only static solution 
of Einstein’s equations which has no singular points and which satisfies the 
boundary conditions will be the solution corresponding to Euclidean space and 
pseudo-Euclidean space-time. This can be shown in the following manner. 
In the case of empty space, equation (56.49) gives (Ad), = 0. Thisis an equation 
of the elliptic type for 4, which represents a generalization of Laplace’s equation. 
The function ¢ and its derivatives ¢; must be everywhere finite and continuous 
and at spatial infinity they must tend to zero. But the only solution of Laplace’s 
equation that satisfies these conditions is the solution ¢=0. But then the 
derivatives ¢; will also vanish and therefore also expression (56.55). Since in 
addition T;, = 0 we also have Hi, = 0. Hence it follows that the curvature 


tensor of the conformal space is zero, and the space itself Euclidean (see 
Appendix G). 


57. Rigorous Solution of the Gravitational Equations for a Single 
Concentrated Mass 
In the case of a concentrated mass a rigorous spherically symmetric solution 
of the gravitational equations can be found. As we are dealing with a static 
case we can use the results of the foregoing section, and write ds? as 


1 
ds? = c®V2 dt? — Wi do? (57.01) 
do? = hix day dx, (57.02) 


[f 2,, Z, and 2, are harmonic coordinates we can introduce spherical coordinates 
‘elated to them by putting 
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Z, = 7* sin} cos 

Zo = 7* sin} sin Y (57.03) 

Ly = 1* cos 9 
The assumption of spherical symmetry implies that the expression for do® is 
of the form 

do? = F? dr*? + 9%(d9? + sin? dp’) (57.04) 

where F and p are functions of r* only. The coefficient V must also be taken 


to depend only on 7*. 
We note first of all that if we put 


F dr* = dr (57.05) 
we can reduce the general expression (57.04) to the case F = 1, so that 
do® = dr? + 9*%(d9* + sin? 9 dp?) (57.06) 


It is true that in doing this it may happen that the radius-vector 7 will be 
““non-harmonic ’’, in the sense that it is not related to the harmonic coordinates, 
Zy, L2 and zz by equations of the form (57.03). But having formed Laplace’s 
equation for the quantities (57.03) with r* replaced by 7 one can always go over 
afterwards to a “ harmonic ” radius vector 7*. 

For the metric (57.06) we get 


(57.07 
hy, = 0, h,, = 9, hi, = 0 
and therefore 
Arr = [. },>> — se. her — : ; 
e° pe” sin? $ (57.08) 
hs? —0, her —0O, hr? — 0 
Hence 
Vh = p? sin 9 (57.09) 
and the Laplace operator in the conformal space may be written as 
1 @ a ] 
where A*¢ is the Laplace operator on a sphere: 
a 1 ay 
Ap = ——— — ens ae 
Yaa 36 (sins =) sin? 9 d9° erry) 


As a consequence of (56.10) the harmonic coordinates must satisfy Laplace’s 
equation in conformal space. For the quantities (57.03) we have 


Ata, = —2x,; (57.12) 
Therefore the condition for harmonic coordinates 


Ax =0 (57.13) 
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reduces to the form 
d (ey _ ar* 


This is the equation to be used in passing from the initial radius vector r to the 
harmonic one, r*. 

By applying the general formulae to the metric tensor (57.07) and (57.08) 
the following expressions for the 18 Christoffel symbols can be derived: 


ly, = 0, re =0, re, =0, 
Mho=- er) Thao, 0, 
Ve = — pp’ sin? 9, = — sin $ cos $, re, = 0, 
p (57.15 
Drs = 0, Tey = = Pe, = 0, ) 
°” 
Fie” Pr, = 0 eg 
bp = 0 T3, = 0, Pg, = cotd 


Here the prime denotes differentiation with respect to r. The Christoffel symbols 
and all tensor quantities used in this section refer only to the conformal space; 
therefore there is no necessity to attach a suffix, as was done in Section 56. 
Using the Christoffel symbols tabulated in (57.15) we form the three- 
dimensional fourth rank curvature tensor and then, using the equations 


Rer = Ro, 9 + RP, oe 3 Ry = RF, a9; 


Ryg = Rh, -9 + RB, oo? Ry» = R?, Bq? (57.16) 
Re. = FE eg Rs be Roy = RB, re 


the second rank curvature tensor in the conformal space. In the equations for 
the non-diagonal components we have omitted terms in which the first lower 
index is equal to the upper index: owing to the symmetry properties of the 
fourth rank curvature tensor these terms vanish if the coordinate system is 
orthogonal. In the general formula (56.16) we leave only those terms which are 
different from zero and so obtain 


Roo = 


ar, 
a +7203, (57.17) 


and after inserting the values of the Christoffel symbols from (57.15) 


“ 


R= (57.18) 
p 
The calculation shows that Rf,,, has the same value: 
Re, op = 5 (57.19) 


Therefore 


ae (57.20) 
e 
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Further, we have 


ars, 
b= oe + TT be (57.21) 
whence 
RS, 79 = PP (57.22) 
Continuing the calculation, we get 
aa 2 1 Pale 57.23 
RS, eo — a9: aP $l 8. = Vs re ( : ) 
whence 
RS, 9 = —1 + ¢? (57.24) 
Inserting (57.22) and (57.24) into (57.16) we find for Ryg the expression 
Rog = pp” + pe? — 1 (57.25) 
Similar calculations give 
Rog = Sin? OR ygs (57.26) 


as was to be expected for reasons of spherical symmetry. The non-diagonal 
elements of the second rank curvature tensor prove to be zero: 


Ry=0, R,_=0, Ry, =0 (57.27) 


The invariant of the three-dimensional] curvature tensor can be calculated from 
the formula 


2 
R = Rry + fo Ryo (57.28) 
and will be given by 
2 
R= = Qpp" + 1) (57.29) 
Applying equation (56.32) we get the following simple expression for the 
conservative tensor of the conformal space 


Le 
Hr = ane Hyp = —pp", Hy, = —pp” sin? 9, 


Cre) a 
H,y=0, H,y=0, Hy, =0 


(57.30) 


We could have obtained these expressions by a somewhat simpler method using 
the relation which, in three-dimensional space, connects the covariant fourth 
rank tensor and the conservative tensor. This relation is discussed in Appen- 
dix G. In the notation of this section equations (G.13) of the Appendix may 


be written in the form 
ht — Ryo, 99° hH** — R 


THs ro . 
hH?? — sg rae hHer — R,», oe hH™® — R 


oe deo or 


hH?? = R 


rh, rO } 


(57.31) 


It is easy to see that these formulae lead to expressions (57.30) as previously 
found. 


The formulae we have derived allow us to write down Einstein’s gravita- 
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tional equations in explicit form. In the previous section we saw that if one 
writes ds? in the form 


ds? = c®e-% dt? — e2¢ dg? (57.32) 

where do® has the value (57.04), the gravitational equations appear as 
Ad = —hup (57.33) 
Hix = —2 bide + hurl did) — xT ix (57.34) 


> 


where the “mass density” p is given by (56.44) and (56.45). Going over to 
the present case of a mass concentrated at a point and using spherical co- 
ordinates, in which the Laplace operator has the form (57.10) while the quan- 
tities Hix are given by (57.30), we obtain 


ld 
A = 24! — 
b= 5G (78) =0 (57.35) 
T=6 
= o ever ss 
e (57.36) 


Hoy = —e9" = ph? 
The equation for H,, differs from that for Hy, only by the factor sin®9, while 
the remaining equations of (57.34) are satisfied identically. 


Integrating (57.35) we get 
erp’ = —a (57.37) 


where « is a constant. Since equation (57.35) is a limiting case of (57.33) with 
positive w, the constant « should be taken positive. Indeed, considering first 
(57.33) and putting 

an [ petdr=M (57.38) 


where the integral is extended over the whole region in which p differs from 
zero, we see that (57.37) holds everywhere outside this region, with « given by 


gal (57.39) 


Here y is the Newtonian gravitational constant, related to x by (56.51). Evi- 
dently M is the mass of the gravitating body; in going over to the case of a 
concentrated mass this quantity, and with it «, remain finite and positive. The 
dimensions of « are those of a length, which is why it is called the gravitational 
radius of the mass. 

Inserting the value of ¢’ from (57.37) into the first equation of (57.36) we 


obtain 
2 


et=145 (57.40) 
e 


and taking the square root so as to satisfy the requirement that we have p’—> + 1 
at infinity, we get 
op’ = (9? + a?) (57.41) 
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Differentiating this expression with respect to r we obtain 
oh ie aa (57.42) 


which shows that the second equation of (57.36) is also satisfied. . 
The differential equation (57.41) is casy to solve by quadrature; after setting 
the additive constant zero, we get 


r= /(p? + a) (57.43) 
whence 
0 == V(r? — a?) (57.44) 
Thus finally 
do? = dr? + (r? — «?)(d9? + sin? do?) (57.45) 


By its physical nature e must be positive and therefore the range of variation 
of r is 
r2>o (57.46) 


We must now discuss the harmonic condition. Inserting the value of o from 
(57.44) into (57.14) we sce that the harmonic radius vector r* satisfies the 
equation 


(r? — a) ee Iv* — 0 (57.47) 


Evidently this equation has the solution 
sg (57.48) 


It is easy to show that this is uniquely the solution which for finite r remains 
finite in the whole region (57.46) and which at infinity differs from r by not 
more than a finite quantity. Therefore the variable r which enters our formulae 


is itself the harmonic radius vector and in place of (57.03) we can simply write 


v1 = 7 sind cos 9, 


T= 7 sin} sin 9, (57.49) 
Lz = 7 cos §. 


It remains to find the quantity ¢. Integrating (57.37) and taking into account 
the boundary conditions we get from (57.44) 


adr 
= [= err: (57.50) 
T 
or 
1, rta 
py te (57.51) 
Hence 
Tr—oa 


Vix 
nie (57.52) 
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The expression (57.01) or (57 .32) for ds? takes on the form 


apie a TE es (57.53) 


rte r—« 


and, after inserting the value of do? from equation (57.45) we get 


r—o + 
ds? = ¢? (=) di? — (: *) dr? — (r + a)%(d92 + sin? d dp?) (57.54) 


a r—a 


The rigorous solution so obtained confirms our conclusion that the conformal 
space is almost Euclidean and that ¢ is approximately equal to U/c?, where U 
is the Newtonian potential for which we can put 
ates 


ra 


U (57.55) 
Indeed, equation (57.45) shows that the components of the metric tensor of 
dc* have relative deviations from their Euclidean values of the order 


a UF 
eS ae (57.56) 
and that the error of replacing ¢ by U/c? will be of the same order. As the 
estimate given in the following section will show, the quantity (57.56) is ex- 
tremely small. One should note that such close agreement with Newton’s 
theory is obtained only if harmonic coordinates are used. 

The solution of the problem of a concentrated mass in a form equivalent to 
(57.54), but in arbitrary non-harmonic coordinates, was first derived by 
Schwarzschild [18] and is often named after him. 


58. The Motion of the Perihelion of a Planet 

We have found a rigorous solution of the gravitational equations which may 
now be applied to the investigation of the gravitational fields of the Sun and 
the planets. 


We have 
Wet a(® 7 *) ae = (: a *) are — (r + «)*(d3* + sin? 3 dp?) 
r+a r—% 
(58.01) 
M 
where oe a (58.02) 


is the gravitational radius of the mass M. For the Sun, and even more so for 
the planets, the gravitational radius « is much smaller than the geometric radius 
L, which may be defined as the radius of a sphere of volume equal to that of 
the body. We can set up the following table: 


Sun Earth Moon 
a is 8 1-48 km 0-443 cm 0-0053 cm 
L Be 696,000 km 6,370 km 1,738 km 
a:L.. i = 2 x 10-8 7 x 190-10 3 x 10-2 


For super-dense stars o is of the same order as for the Sun while LZ, though 
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smaller than for the Sun is not more than a hundred times smaller. Because 
of the smallness of « : Z the metric of space-time is not very far from Euclidean 
even near and inside the mass. This is all the more true in the conformal space, 
where by equation (57.56) the deviations from Euclidean values will be of the 
order («/Z)?. In comparing ds? with its Galilean value one must remember that 
the coefficient of dt? has the large factor c?, while there is no such factor in the 
spatial part. Therefore, when the speeds involved are low, as in planetary 
motion, the deviation of go) from. a constant value shows up much more sensi- 
tively than do comparable relative deviations in the spatial part of the interval. 
Even in Newtonian approximation the Galilean value go) = c? must be replaced 
by the more accurate expression Jp) = c? — 2U. 

We shall now compare our rigorous solution of the gravitational equations 
with the approximate solution discussed in Section 55. To do this we must 
transform (58.01) from spherical to rectangular coordinates which must be 
harmonic. We can write the spatial part of (58.01) 


Ae 
dy? = = dr? + (r + «)*(d9? + sin? 9 de®) (58.03) 
in the form 
r+ta a? ce 
dl? = a as dr? + (: + *) (dr? + 72. d92 + r2 sin? 9 dp?) (58.04) 


In this last form it is simple to pass to rectangular coordinates. We get 


ds? = oc? * gy — 1 ae “Ud 7 + da? + dz? 
ee _) (det + dag + dag) 
rtaa? 
Soret (x, dx, + 2, dx_. + x%3dz3)? (58.05) 
a\* rta «a? 
whence IJik = -(1 + *) ou = ars : ar LiLE (58.06) 
r—a 
and also Soo = C2 ome re Io = 0 (58.07) 


If in (58.05) we neglect the square of the ratio a/r we obtain the approximate 
expression (55.45) with the Newtonian potential 
‘ 
U = c2 . = yM 
r r 
Equations (58.06) and (58.07) lead to the follow; ] i 
i wing value of the determinant 


(58.08) 


=— ca(1 + =" (58.09) 


H(- ) =1+- (58.10) 


satisfies a Laplace equation with Euclidea i 
, n coefficients. In th 
(Section 68) we shall see that quite generally the fourth roo n the next chapter 


We note that the quantity 


t of (— g/c?) approxi- 
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mately satisfies a d’Alembert equation with Euclidean coefficients. 
Using (58.06) and (58.07) again, or else transforming directly the equation 


ps ey 


0x, Ox, r—a c?\ a r+a/ \ér 
1 ab\ 2 1 a\ 2 
eam [(35) + ames (Z) | on 


to rectangular coordinates, we obtain the values of the contravariant com- 
ponents of the metric tensor. We give the expressions for these components 
multiplied by »/(— g). They are 


gtk = +/(— g)-gt = — Sip + ca? = (58.12) 
wal 3 
re gee = = os ot =0 (58.13) 


These relations allow one to verify readily that our coordinates are really 
harmonic and that 


CUS ai (58.14) 


Having found the gravitational potentials for the field of a concentrated 
mass we can determine the motion of a particle in the field with the aid of the 
assumption that it will move along a geodesic. We saw in Section 51 that this 
assumption is in agreement with Newtonian mechanics. A fuller justification 
of this law of the motion of a mass point will be given in Section 63, where it 
will be deduced from the gravitational equations. 

As we know the equations of a geodesic are cbtainable from the variational 
principle. 


8 J ds = 0 (58.15) 
which may also be written in the form 
3 { Lda=0 (58.16) 


Here L is the Lagrangian and in our case it is equal to the square root of the 


expression 


POPC Ons : ; 
a (242, + Lob, + Lyk)" 


(58.17) 


r—a 


r+ea 


[2—— 


a 2 
~(. +) +a - 


r—a 


in which the dots denote differentiation with respect to time. We thus have 
before us a simple problem in point mechanics. 

In order to solve this problem we note first that the Lagrangian is spherically 
symmetric. This means that it does not change when the sets of quantities 
(x,, Za, Z,) and (Z,, Zz, 43) undergo the same linear orthogonal transformation. 
A consequence of this, as always in mechanics (and see also Section 27), is 
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that there exist the integrals of the motion 


aL aL 
— Vea = C 
Lo ai 3 aa 1 
eL oL 
op ope ere) 
oL oL 
L,—— %y— =C 
‘a, "Ot, ° 


Consequently the trajectory of the particle lies in the plane 
C121 + Cot, + Cytg = 0 (58.19) 
It does not restrict the generality of the problem to choose this plane as one 
of the coordinate planes, so that 
t3=0; 2,=0 (58.20) 
We then have to deal only with motions in the plane, and plane polar coordinates 
are most convenient for this. Our previous spherical coordinates reduce to 
these if we put 
9=}n; $=0 (58.21) 
Rewriting the square of the Lagrangian in these polar coordinates we get 
r—a rt+a. 
—— 7 


r+e r—@ 


2 (r + a)%6? (58.22) 


The Lagrangian is independent of the time ¢ and also independent of the 
angle ». This immediately gives us two integrals : 


re »9— —-L= S . 
r a + © 36 const (58.23) 


and — = const. (58.24) 


which correspond to the usual integrals of energy and of angular momentum. 
Remembering that 


Ldt=ds=cdz (58.25) 
where t is the proper time, we can rewrite the integrals (58.23) and (58.24) as 
r—adt 
Tee i (58.26) 
d 
and aa ae 
(r + a) que (58.27) 


Here « and . are constants. The quantity u. can be interpreted as the angular 
momentum of a unit mass. If we put 


Eo 
ee] + a (58.28) 
where £, is a new constant, our equations show that in non-relativistic approxi- 
mation 
yM 
. 


143 
E,=tv 


(58.29) 
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so that E, is the total energy of a particle of unit mass. 
An algebraic consequence of (58.26) and (58.27) is the relation 


ae (58.30) 
It is derived by inserting (58.26) and (58.27) into the identity 


a (=) (z) - (=) (Zz) — (Poe a2) = c2 (58.31) 


The foregoing equations give us three first order differential equations for the 
quantities 7, p and ¢ as functions of t. The solution of these equations evidently 
reduces to quadratures. We shall not give the integrals explicitly but will 
confine ourselves to the discussion of the trajectory of the particle, i.e. of the 
dependence of r on 9. 

Eliminating d+ from (58.27) and (58.30) we obtain 


2 2.2 2 
(5) a Geet == 4 aye we Se) (6889) 
dp pe? pe? 

We have here on the right-hand side a polynomial of the fourth degree in r. 
Therefore » will be expressible in terms of 7 as an elliptic integral of the first 
kind and conversely r will be an elliptic function of ». The real period of this 
elliptic function will differ somewhat from 27 and therefore the orbit will not 
be a closed one. The polynomial on the right-hand side of (58.32) obviously 


has the one negative root r = — «; it also has a small positive root 
BaFc%e2 
rye . (58.33) 
U 


and two further roots r, and r,. If e? < 1 both these roots are positive and we 
shall always have r, <r <7,, i.e. the orbital motion is a finite one. If, on the 
other hand, e? > 1, one of the roots, say 7,, becomes negative ; we then have 
r, <7, and the orbit extends to infinity. If «2 = 1 we have rz, = oo. 

If we introduce the variable 


u=- (58.34) 
Tr 


in place of r and write the polynomial in an expanded form, we get 


(5) = c%(e* — 1) Be 2ac? (2e? — l)u Ae (=> aS 1) ut 
de pe pe e 


a (oa + 1) 


a a (Sa) 
U 


2 


We now estimate the orders of magnitude of the various terms in this expression. 
As in Section 55 we introduce a characteristic velocity g and a characteristic 
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length 1. Then we have, in orders of magnitude 


q? 
e—l 0S; ur ~ 124? 
Cc 
2 1 


From these estimates we can easily see that on the right-hand of (58.35) the 
terms involving the zero, first and second powers of a wu are of the order of 1/1? 
whereas the terms containing the third and fourth powers of u are of order 
(q/c*):1/22. If, therefore, we merely neglect some very small terms, namely 
those of order g*/c* (or «?/l?) compared to unity, we can drop the last two 
terms in (58.35) leaving the equation 


2n222 
(Fe) SE) TE ect yu (SEE aut 6880) 
dp p? . u 


The roots of this quadratic polynomial will correspond to the two roots referred 
to above as 7,, and r,. We put 
1 Le. 
Uy _— | ; 
at Pp 2 Pp 
where p and e are new constants related to our original constants ¢ and p. 
Approximately we have 


(58.37) 


(58.38) 
pe? = ac®p = yMp 
6a 
We also put ve= 1] — 7 (58.39) 
: 3a 
so that, approximately v=1— a (58.40) 
In this notation we can restate equation (58.36) as 
2 eee 7 
Ee 3) — peels +-—u—vw? (58.41) 
vi \do pep 
The solution of this equation is 
et 1 +e Cos vo (58.42) 


p 
Here the constant of integration has been so chosen that the largest value of 
u or the smallest value of the distance r corresponds to the value g=0. The 
expression (58.42) describes well the general nature of the motion. If v were 
equal to 1 we would have an ellipse, a parabola or a hyperbola of semi-latus 
rectum p and eccentricity e. We shall discuss the case of an ellipse, e <1. 
The radius vector r returns to its original value when the angle © increases 
not by 2m but by the somewhat greater value 2n/v. The difference 
ar 6rra 


Ag = — — 2x = — (58.43) 
v p 


gives the displacement of perihelion after one period of revolution of the planet. 
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The orbit of the planet may therefore be described as a precessing ellipse. 

One may note that Eimstein’s equations of motion of a planet reduce to the 
form of the classical equations of motion of a spherical pendulum ; therefore 
the trajectory of the planet has the same form as the trajectory of the bob of 
a pendulum.t 

For all planets the numerical value of qg' is exceedingly small. For the 
Earth, for instance, we can take p = 1-5 x 108 km and, using the value 
x = 15km, we get 

Ag = 6x X 10-8 = 0-038” 
in one revolution, i.e. one year, or in other words 3-8” per century. For Mercury 
the advance of perihelion per century is much greater, namely 43”, because 
in the first place it is considerably closer to the sun, having an orbit of a radius 
0-39 times that of the Earth, and also because its period of revolutions is 
shorter—420 revolutions per century. 

In comparing this theory with experiment it is important to remember 
that a motion of the perihelion is caused not only by the Einstein effect but also 
by the perturbing influence of other planets, by their deviation from spherical 
shape and so on. Such corrections are many times greater than those arising 
from Einstein’s theory. One must also bear in mind that the observation of 
the position of the perihelion is the more difficult the smaller the eccentricity 
é, i.e. the closer the orbit is to circular shape. For e = 0 the position of peri- 
helion is indeterminate. Nevertheless, astronomical methods of observation 
are so accurate and computational possibilities in celestial mechanics so great 
that in the case of Mercury a residual advance of the perihelion, unexplained 
by- Newtonian theory, is known to an accuracy of one second per century. 
It is 42-6” per century, in excellent agreement with the theory. For the Earth 
this residual advance of the perihelion is known with rather less accuracy to 
be about 4” ; this is also in full agreement with the Einstein value. 


59. The Defiection of a Light Ray Passing Near the Sun 


We now consider another observable consequence of Einstein’s theory of 
gravitation, the deflection of a ray of light passing close to the Sun. 

Before integrating the equation for a light ray let us first form a general 
picture of the propagation of light in the gravitational field of the Sun. 

We shall write the law for the propagation of the front of a light wave in 
the form 


ge —— — = 0 (59.01) 


where we have multiplied our previous equation by +/(—g). Using equations 
(58.10) and (58.11) we have, in spherical coordinates 


ober 3 ()*— (0 -2908)'— A [GRY ar N= 
(59.02) 


If we neglect terms of the order «?/r? in comparison with unity the equation 


{ See the diagram in A. N. Krylov’s book [19]. 
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for w reduces to 


2 2 
ie (3) — (grad w)? = 0 (59.03) 
c 
4 2 
where nt=lt—5 n=1l+— (59.04) 


This equation can be formally interpreted as the law of propagation of light 
in Euclidean space, but in a medium of refractive index n. 

We note that (59.03) could also have been obtained from the approximate 
expression 


2 
ds* = (c? — 2U)dt? — (1 + =) (da? + daz + dz3) (59.05) 
(see (55.45)) the effective refractive index then being 
2U 
m=1+ 5 (59.06) 


On the other hand, the approximate form (51.10) for ds? which was applicable 
for slow motions would lead to a form for the effective refractive index in which 
the coefficient of U is half the above. As we shall see below experiment confirms 
the expression (59.06) derived from the quadratic form (59.05). 

The fictitious medium of refractive index (59.06) is optically more dense 
in the vicinity of the Sun than it is far away from it. Therefore, light waves 
will bend around the Sun and light rays passing near the Sun will not be 
straight. We shall see that they will be describable as branches of hyperbolae 
with the Sun at one focus. The angle between the asymptotes of such a hyper- 
bola will determine the observed deflection of the ray. 

As we know from Section 38, a light ray represents a nul geodesic and its 
Hamilton-Jacobi equation is the equation of wave front propagation. (See 
also Appendix F.) Since in the previous scction we have already solved the 
problem of a geodesic of finite length we can obtain the equation of a ray from 
the results of Section 58 by means of a limiting process. Let us recall these 
results. We found the integrals of motion 


Akg 58.26 
rtak* een 
d 
and Be 
(7 + a) ‘i (58.27) 


and also the equation for the trajectory 

dr\2 2 ‘me 

aa) = a’ (7 + a4 — a (7 + a)(r — a) —(r +a)(r —«) (58.32) 
Since for a light ray dr = 0 the constants in (58.26) and (58.27) will be infinite, 
but their ratio 


f= ae (59.07) 


will be finite. Consequently, equation (58.32) will have the form 
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(=) _@ ; 
ae) Tt FOO bay ~a) (59.08) 


In place of the constant u, it is convenient to introduce another constant }, 
of the dimensions of a length, by the relation 


lim : =p,—o (59.09) 
Then (59.08) becomes 
dr\? 1 
ie =F: (r + a)* — (r+ a)(r — a) (59.10) 
and the corresponding relation for u = 1/r is then 
du\2 1 
ie = (1 + au)? — wu? + aut (59.11) 


If r and ¢ are interpreted as polar coordinates in a Euclidean plane the constant 
b just introduced will be the “‘ impact parameter ”’ i.e. the length of the per- 
pendicular from the origin to the asymptote of the trajectory. Indeed, elementary 
relations in plane Euclidean geometry give the following expression in polar 
coordinates for the perpendicular distance from the origin to the tangent toa 
curve 
r 1 


i a An A Ee 
V{l + (dr[de)?/r?} \/{u? + (du/de)?} 


The asymptote is the tangent at infinity (u = 0) and, therefore, the impact 
parameter is the value of d for u = 0. By (59.11) and (59.12) this is equal to b. 

Let us return to the equation for the path of the ray in the form (59.11). 
If u is taken to be of order 1/b the terms involving wu? and w‘ in this equation 
will be at least of order «2/6%in comparison with the main terms. Dropping these 
small terms we obtain an equation which can be solved by elementary means. 
The solution has the form 


(59.12) 


2a 1 
= Be + 5 cos © (59.13) 


U 
The constant of integration has been chosen so that the greatest value of 
u, and therefore the least of 7, corresponds to ¢ = 0. We find approximately 


=b — 2a (59.14) 


Tmin 
In the Euclidean (7, 9) plane equation (59.13) describes a hyperbola. The 
directions of the asymptotes of this hyperbola are determined from the con- 
dition u = 0 which gives 
2a 
cos 9 = —— (59.15) 
b 
Here the right-hand side is a small negative quantity, so that the limiting 
values of the angle wi!] be 
pare 8s o=—Ix—8 (59.16) 
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where the small positive quantity 8 can be put equal to 


at (59.17) 


b 


The angle by which the ray is deflected is the angle between the asymptotes 
of the hyperbola, which is 
28 = ae (59.18) 
b 
The displacement of the observed position of a star whose light passes close 


to the Sun can be observed during a total eclipse of the Sun. If 6 is put equal 
to the Sun’s radius the angle of deflection 28 takes on the value 


28 = 1-75" (59.19) 


which is in good agreement with observed values. An evaluation of the results 
of observations during the eclipse of 1952 gives a value of 1-70". This result 
allows one to assert quite definitely that the observations agree with the 
expression (59.05) for ds? and not with (51.10) which predicts 0-87”, half of the 
observed value. 

In conclusion we make a remark concerning the definition of a straight line 
in the theory of gravitation. How should a straight line be defined : as a light 
ray or as a straight line in that Euclidean space in which the harmonic co- 
ordinates x,, 2, and x, serve as cartesian coordinates? It seems to us that the 
only correct definition is the latter. We have, in fact, used it when we said that 
a light ray near the Sun has the form of the hyperbola (59.13). In the cases here 
considered the harmonic coordinates are deeply related to the nature of space 
and time and the definition of a straight line should be based on them. The 
consideration that a straight line defined as a light ray is more immediately 
observable, is of no significance. What is decisive in a definition is not immediate 
observability but a correspondence with Nature, even if this correspondence 
has to be established by indirect reasoning. 


60. A Variational Principle for the Equations of Gravitation 
In the equations of gravitation 


8 
RY —4gR = — pa T! (60.01) 


we have on the left-hand side the conservative tensor and on the right the 
mass tensor. In Section 48 we saw that the expression for the mass tensor 
can be obtained by varying the action integral with respect to the components 
of the metric tensor. Thus in the case of the equations of hydrodynamics the 
action integral could be written in the form 


S = | (e*c* + p*T)V/(— g)- (dx) (60-02) 


Here e* is the invariant density of that part of the rest-mass which does not 
change in the motion and which satisfies the equation of continuity (48.29) 
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and II is the elastic potential energy of the fluid per unit mass as defined by 
(48.30). If the integral S is varied with respect to the components of the metric 
tensor the result is 


a8 =F | D899) (en) (60.03) 


where TJ’ is the hydrodynamical mass tensor defined by (48.39). In the case 
of electrodynamics the action integral has the form (47.37) ; its main term is a 
function of the rest-mass and is the same as in the hydrodynamical case. The 
variation of the electrodynamic action integral with respect to the g,, again 
has the form (60.03) where now T¥’ is the electrodynamic mass tensor as given 
by (46.22) and (46.32). As regards variations of the action integral with respect 
to other quantities entering it, such as displacements and field components, 
we saw that they give the equations of motion of the material system in 
question. 

We shall assume that we have a system such that for it equation (60.03) is 
valid with a suitably chosen S. Then the mass tensor which enters the right-hand 
side of the gravitational equations can be represented as the coefficient of 8y,, 
in the expression for the variation of a certain integral. We shall now attempt 
to represent the left-hand side of-the gravitational equations, 1.e. the conservative 
tensor, in a similar manner. 

To do this we consider the expression 


l= i Ry/(— g)-(dz) ° (60.04) 


where F# is the curvature scalar and we perform the vamation of this integral. 
In evaluating the variation of the integral we use the fact that the variations 
of the Christoffel symbols form a tensor, although the Christoffel symbols 
themselves do not. This can be proved as follows. According to (42.04) the 
transformation law for Christoffel symbols has the form 
2x) = on, — _ (re, ax, ary 
Ox,da, »” aa, 
This relation is valid for the quantities Ie, related to the given metric (9,,). 
We perform a variation of the metric while retaining the connection between 
old and new coordinates. To the metric (g,, + 59,g) there will correspond the 
quantities re +68 Ts where 


— (60.05) 
ex, on, 


oa ox’ Ox’, 
oF (Se. eB 60.06 
sre, 2a, (81'3,) an, Be, (60.06) 


This proves that the variations 81, form a mixed third rank tensor. 
In forming the variation of the scalar R we start from the expressions 


ol or : 
a. = Be v om 
u, BV Br, ii a ge ier alta rel (60.07) 
and ; 
ae, a, 1, 
= Se need bel 60.08 
pv or, ax, + Bie bie RA ha ( 0 ) 


which give the mixed components of the fourth rank curvature tensor and 
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the second rank curvature tensor. In a coordinate system that is geodesic at a 
given point the variation of F,,, is 
a Bote 
shy = an, (81%) = an, (os) (60.09) 
because at this point all the Christoffel symbols vanish. Hence with a change 
of indices in the first term on the right 


a a 
g SR, = 9 = (8re,) — g” me (s1'3,) (60.10) 
We now introduce the vector 

. wt = gore, — gOPe, (60.11) 

It is easy to see that (60.10) is equivalent to the equation 

1 7) 
8k = ———— — {/(— 9) -w* (60.12) 
g8R,, WSs) a tv g) wu} 


since in a geodesic system one can take the quantities g"” = 4/(— g)-g*” out- 
side the sign of differentiation; but both sides of the equation (60.12) are 
scalars so that if it. holds in a geodesic system it must hold generally. 

We assume that not only the 8g,, but also their derivatives and therefore 
the 61%, vanish on the boundaries of the region of integration and that the 
vector w* also vanishes. Then, writing the integral J as 


T=] B,g9°/(— 9)-(dz) = [ B,,9"(d2) (60.13) 


we obtain for its variation the expression 


$I = | R,,8g"(dz) (60.14) 
since by (60.12) we have 
J 8R,.-9"V(— 9): (de) = 0 (60.15) 
Using the relations 
8V(— 9) = 8V(— 9) 9? 89a (60.16) 
and 
dg” = — tg? 89.4 (60.17) 
we get 


Sa” = 8{V(— 9) 9%} = V(— 9) (R998 — gg") (60.18) 
Inserting this expression into (60.14) and performing the summations over 
uw and v we obtain 


aL = | bg R — R*)89,,/(—9) (dz) (60.19) 


Our immediate obj ect has been achieved ; we have represented the conservative 
tensor as the coefficient of the variation with respect to the metric tensor. 


Combining the equations (60.03) and (60.19) we can conclude that the 
variation of the expression 


I 
= (60.20) 
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with respect to the components of the metric tensor is 


OPP [a c? 
8,W = | [7 ue — (Rev — $gR)/89,./(— g):(dz) (60.21) 


This variation vanishes by virtue of the gravitational equations (60.01) and these 
equations in turn may be obtained from the variational principle 3W = 0, if 
the variation is with respect to the components of the metric tensor considered 
as independent. (We recall that we have already discussed the variation of 
the action integral with. respect to the g,, in Section 48, but that there the 
89,,, Were not completely arbitrary because they corresponded to an infinitesimal 
change of the coordinates and were expressible in terms of four functions 4,,.) 

If W is varied with respect to the other functions that enter the action 
integral S we obtain the equations of motion and field equations for these 
functions. 

Thus the equations (60.01), the field equations of gravitation, are now unified 
with the other field equations (for the velocity field of matter, the electro- 
magnetic field, etc.) in one general variational principle. 

The variational principle may be given a somewhat different form by taking, 
in place of the invariant integral 


I= | Ra/(— g):(dz) (60.04) 
another integral which is not invariant, but which does not contain any second 


derivatives. 
In Appendix D we derive the relation 


RoHS SE (60.22) 


where [J is the d’Alembert operator acting on y= log (— g), see (D.51), I is 
given by (D.43) and the quantity may, according to (D.54), be written as 


L = g(V8,0%, — 020%.) (60.23) 


nv af 


as well as in many other forms given in Appendix D. 
Using the definitions (D.59) and (D.61) one can also replace (60.22) by 


1 7) 


Se R= GPP EL (60.24) 
R eo) in, (V' 9) Y )} 
The integral J will differ from the expression 
[* = — i) Ln/(— g): (dz) (60.25) 
by the quantity 
r= | 2-9 —Py@ (60.26) 


which reduces to a surface integral and has a vanishing variation. Therefore 
the variations of J and J* are equal : 

sl =8 i Ry/(— g):(dz) = —8 | Ly/(—g):(dz) = 81* (60.27) 
Since in a variational principle only the variations of the integrals, not the 
integrals themselves, are of importance, we can replace I by Z* in (60.20). 
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The quantity 81* (which is equal to 82) is independent of the coordinate system 
in spite of the fact that I* itself may depend on the coordinate system. The 
purpose of replacing I by I* is the removal of second derivatives from the 
integrand. The relation 


8 [ Ly(—9)-(dz) =f (Re — 49° R) BgopV(—9)-(dz) (60.28) 


which follows from (60.19) and (60.27) can naturally also be derived directly, 
though the necessary calculations are somewhat involved. 


61. On the Local Equivalence of Fields of Acceleration and of Gravitation 


As we have seen, the law of equality of inertial and gravitational mass, which 
is a rational dynamic generalization of Galileo’s law (Section 50), forms an 
intuitively satisfactory basis for the theory of gravitation. We therefore used it 
as the foundation of our argument which, after some generalizing assumptions, 
led us to Einstein’s gravitational equations (Sections 51 and 52). When Einstein 
himself derived his gravitational equations, he argued rather differently. In 
contrast to our treatment based on the dynamic formulation of Galileo’s law 
Einstein took as his basis a certain kinematical consequence of that law, which 
he called the “ Principle of Equivalence ”’. 

In the theory of gravitation the Principle of Equivalence is understood to be 
the statement that in some sense a field of acceleration is equivalent to a 
gravitational field. The equivalence amounts to the following. By introducing 
@ suitable system of coordinates (which is usually interpreted as an accelerated 
frame of reference) one can so transform the equations of motion of a mass 
point in a gravitational field that in this new system they will have the appear- 
ance of equations of motion of a freé mass point. Thus a gravitational field can, 
so to speak, be replaced, or rather imitated, by a field of acceleration. Owing 
to the equality of inertial and gravitational mass such a transformation is the 
same for any value of the mass of the particle. But it will succeed in its purpose 
only in an infinitesimal region of space, i.e. it will be strictly local. 

In the general case the transformation described corresponds mathematically 
to passing to a locally geodesic system of coordinates (see Section 42). As was 
shown by Fermi, it is possible to introduce coordinate systems which are locally 
geodesic not only at one point but also along a time-like world line (see [14}). 

Thus the principle of equivalence is related to the law of equality of inertial 
and gravitational mass, but is not identical with it. The latter is of a general, 
non-local character while the equivalence of a field of acceleration and a field 
of gravitation exists only locally, i.e. it refers only to a single point in space 


(more precisely to a spatial neighbourhood of the points on a world line, which 
is of the nature of a time axis) 


The Principle of Equivalence played an important role during the period 
before Einstein created his theory of gravitation. We shall now describe and 
analyse an argument given by Einstein at that time. 

Einstein illustrated his “ equivalence hypothesis” with the example of a 
laboratory inside a falling lift. All objects within such a lift appear bereft of 
their weight, they all fall together with the lift, with the same acceleration, 
so that their relative accelerations vanish even when they are not fixed to the 
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walls of the lift. We have, according to Einstein, two frames of reference, 
one inertial, or almost inertial, fixed to the Earth and another accelerated, 
fixed to the lift. In the first, inertial frame, there exists a gravitational field— 
in the second, accelerated frame, it is absent. Thus, according to Einstein, an 
acceleration can replace gravitation or at least a uniform field of gravitation. 
Einstein develops this idea further. He proposes to consider both the accelerated 
and the unaccelerated frames to be physically completely equivalent and points 
out that from such a point of view the concepts of inertial frame and absolute 
acceleration cease to have any meaning. 

Lét us analyse this view of Einstein’s in more detail. First of all the question 
arises : What is an accelerated frame of reference and how can it be realized 
physically? In the-lift example the “ frame of reference ” was, so to speak, 
identified with a certain box, the lift cage. But we have learnt in Section 32 
that even when gravitation is not taken into account the abstraction of an 
absolutely rigid body is not acceptable; when accelerated all bodies will 
experience deformations which will be different for different bodies. Therefore 
a box or a rigid scaffolding of the kind we discussed in Section 11 when dealing 
with inertial frames are of no use as models for an accelerated frame of refer- 
ence. Thus in Einstein’s reasoning the basic concept of a frame of reference 
in accelerated motion remains undefined. This difficulty could be avoided 
only by imposing limitations on the magnitude of the acceleration and on the 
size of the region of space to be considered. For instance, one could demand 
the following : the accelerations allowed are to be so small that in the region 
of space considered the deformations resulting from them may be neglected 
and the notion of a ngid body may be used. In that case the approximate 
nature of Einstein’s argument becomes obvious. 

Further, Einstein himself stresses that not every gravitational field can be 
replaced by acceleration ; for this to be possible the gravitational field must be 
uniform. This also imposes limitations on the spatial dimensions of the region 
in which gravitational and accelerated fields may be approximately equivalent. 
It is, for instance, impossible to “remove” the gravitational field around the 
terrestrial globe ; to do it one would have to introduce some absurdity such as 
a frame of reference in “ accelerated contraction ”’. 

Einstein also used his Principle of Equivalence in a non-local manner but 
his attempt, in a paper published in 1911, to investigate in this way the propa- 
gation of light near a heavy body gave a deflection of a light ray of only half the 
amount resulting from his theory of gravitation (see Section 59). This is con- 
nected with the fact that the Principle of Equivalence cannot possibly lead 
to the correct form (51.11) for ds? but at best only the expression (51.10) which 
is valid for slow motion. Thus, in a non-local interpretation, the approximate 
equivalence of fields of gravitation and of acceleration is also limited. As already 
mentioned this equivalence exists only for weak uniform fields and slow motions. 

Einstein gave to his principle of equivalence a widened interpretation by 
taking it to imply the indistinguishability of fields of gravitation and accelera- 
tion and asserting that from the point of view of this principle it is as Impermus- 
sible to speak of absolute acceleration as it is to speak of absolute velocity. 
To this Einstein related his ‘‘ General Principle of Relativity ” which we dis- 
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cussed in Section 49*; he used the latter to justify the demand that his equations 
should be generally covariant. However, to us such an extended interpretation 
seems inconsistent. The essence of the principle of equivalence may be seen in 
the fact that it allows the introduction of an appropriate locally geodesic 
(“ freely falling ”) frame of reference, by use of which a uniform Galilean space 
can be defined in the infinitesimal. However this in no way justifies conclusions 
about the equivalence or indistinguishability of fields of acceleration and of gravi- 
tation in finite regions of space. To illustrate the nature of the error committed 
in drawing such conclusions let us examine a mathematical example, which 
incidentally has a direct bearing on the essence of the present question. All 
functions that have bounded second derivatives behave as linear functions in 
the infinitesimal. However, this by no means allows one to conclude that all 
such functions are indistinguishable in a finite region. But an analogous con- 
clusion, namely that fields of acceleration and of gravitation are completely 
indistinguishable, was drawn by Einstein, on the basis of their local equivalence 
alone. 

Such a conclusion even contradicts Eimstein’s theory of gravitation itself. 
Indeed, if full equivalence between fields of acceleration and of gravitation did 
exist, a theory built on the idea of equivalence would be purely kinematical, 
which is by no means the case for Kinstein’s theory of gravitation. As regards 
the “ General Principle of Relativity ’, we have already pomted out in Section 
49* that such a physical principle is impossible, and also unnecessary as a basis 
for the requirement of general covariance, which is the purely logical require- 
ment of consistency for a theory in which the coordinate system is not fixed. 

Thus, although the principle of equivalence holds in a narrow sense (approxi- 
mately and locally) it does not hold in a wider sense. Although the effects of 
acceleration and of gravitation may be indistinguishable “‘ in the small ’’, i.e. 
locally, they are undoubtedly distinguishable ‘“‘in the large’, i.e. when the 
boundary conditions to be imposed on gravitational fields, are taken into account. 
The gravitational potential that is obtained if a uniformly accelerated frame 
of reference is introduced is a linear function of the coordinates and therefore 
does not satisfy the conditions at infinity, whcre it should tend to’ zero. In a 
rotating frame of reference the potential of the centrifugal force increases with 
the square of the distance from the axis of rotation, and in addition there are 
Coriolis forces. By these characteristics it is possible to detect immediately 
that the “ gravitational field ” in such frames of reference is fictitious. 

We shall now discuss the example of a uniformly accelerated frame of 
reference in somewhat greater detail, taking the theory of relativity into 
account. In doing this we set aside the question of how an accelerated frame 
might be realized and interpret the term “‘ frame of reference” more formally 
in the sense of “ coordinate system ”’. In this sense passing to a frame moving 
with acceleration will mean subjecting the coordinates to a transformation 
which contains time non-linearly. 

We assume that a true gravitational field is not present so that the square 
of the infinitesimal interval has the form 


ds? = ¢? dt’? — (da’? + dy’? + dz’2) (61.01) 
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where x’, y’ and z’ are cartesian coordinates and ¢’ the time in some inertial 
frame of reference, We perform the coordinate transformation} 


_— 2 t 
z= nent + eS (cosh — i 
Cc g c 
y =Y;3 2% (61.02) 
gt 


c 


f =2 sinh ® + 2 sinh 
; g coe 
where g is a constant of the dimensions of acceleration. Under the condition 
t 
z ral (61.03) 
the previous equations may be written as 
v=2+ igt?; y =Y; ee v=t (61.04) 
Tnserting (61.02) into (61.01) we obtain 


2 : 
ds? = (< oa *) dt? — (da® + dy? + dz?) (61.05) 


The question arises: can this expression be interpreted as the square of the 
interval in some inertial frame of reference in which there is a gravitational 
field? The answer to this question is also an answer to the question whether, 
and to what extent, the Principle of Equivalence is correct. 

To find the answer we compare (61.05) with the approximate expression 
given by the theory of gravitation. 


ds? = (c? — 2U) dt? — (1 4 =) (da? + dy? + dz?) (61.06) 


where U is the Newtonian potential of a true gravitational field. 

Under the condition 

|gx| <c? (61.07) 

the coefficients of dt? are approximately equal if we take a gravitational potential 
given by 

‘ U = —gz (61.08) 
As for the coefficient of the spatial part of ds?, it will not differ significantly 
from unity for intervals for which the quantity 


2 /dy\2 [dz\2 
we. (=) 4 (2) 4 (=) (61.09) 
v <e? (61.10) 


The value (61.08) for the gravitational potential does indeed lead to uniformly 
accelerated motion in Newtonian. mechanics. For vanishing initial velocity 
we have constant values of 2’, y’ and 2’ and approximately 


satisfies the inequality 


x + 4gt? = const. (61.11) 
{ This transformation was given by Moller [20]. 
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which describes uniformly accelerated motion in the coordinates (2, ¢). 

We have made a comparison between two expressions for the square of the 
interval which has shown that a frame of reference in accelerated motion in 
the absence of gravitation dues indeed show an analogy with an inertial frame 
in the presence of gravitation. However, the same comparison indicates that the 
analogy is far from complete, so that there can be no question of full equivalence 
or indistinguishability of inertial and gravitational fields. This becomes par- 
ticularly clear if one considers the expression (61.05) “in the large”’, i.e. 
throughout the whole of space. In the first place the coefficient of dt? does not 
saitsfy the boundary conditions, since it tends to infinity with z, in the second 
place that coefficient and with it the speed of light, become zero on the surface 
z= — /g; this is admissible. 

An even more obvious violation of the boundary conditions for the metric 
tensor occurs if the transformation (35.47) is used. In Newtonian mechanics it 
has the significance of introducing a rotating coordinate system. This trans- 
formation leads to the expression (35.48) for ds®. Here the metric tensor not 
only fails to satisfy the boundary conditions but, at large distances from the 
axis of rotation, also violates the inequalities established in Section 35. The 
impossibility of interpreting the metric tensor in (35.48) as some gravitational 
field (i.e. in the spirit of the ‘“ equivalence hypothesis ’’) is clear even from a 
local point of view, owing to the presence of Coriolis forces. 

The example just discussed confirms completely the conclusion stated above 
that the “ equivalence ” between acceleration and gravitation exists only in a 
limited region of space, and only for weak and uniform fields and slow motions 
(equation (61.08) together with the inequalities (61.07) and (61.10)). But if 
one considers the whole of space, true gravitational fields can be distinguished 
from fictitious ones caused by acceleration. In Newtonian theory this can be 
done by using the boundary condition for the Newtonian potential. In Ein- 
stein’s theory the question of distinguishing true from fictitious gravitational 
tields is most simply solved if harmonic coordinates are used. Then the com- 
ponents of the metric tensor must satisfy both the harmonic conditions (53.13) 
and the boundary conditions discussed in Section 54. As will be shown in Section 
93, harmonic coordinates can be defined uniquely apart from a Lorentz trans- 
formation. Arbitrary coordinate transformations by which fictitious gravita- 
tional fields are introduced, violate the harmonic conditions and the boundary 
conditions. Therefore one can take it that the introduction of harmonic co- 
ordinates excludes all fictitious gravitational fields. Thus, if one assumes the 
quadratic form (61.05) to be given, the passage to harmonic coordinates will 
consist in the transformation (61.02), accompanied possibly by.a Lorentz 
transformation. As the result of such a transition we come back to the quadratic 
form (61.01), the form of which indicates the absence of true gravitational fields. 

In the discussion of this section we did not use general tensor analysis. 
Its application to (61.05) would have shown that the fourth rank curvature 
tensor vanishes and that, therefore, true gravitational fields are indeed absent. 

Let us Teturn to the question of utilizing the principle of equivalence to derive 
the gravitational equations. We have made it clear that it is inconsistent to 
interpret this principle in a wider sense as a “‘ General Principle of Relativity ”. 
But this does not exclude the use of the principle of equivalence in a more re- 
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stricted sense, within the limits in which it is valid approximately. In particular 
the analogy we have discussed between an accelerated frame of reference in the 
absence of a gravitational feld and an inertial frame in the presence of such a 
field may prove helpful, because the possibility of transforming the expression 
(61.01) into the form (61.05) gives us an indication of the fact that the Newtonian 
potential must enter the theory precisely as the coefficient of di?. However, an 
approach based on this idea to the formulation of a gravitational theory seems 
to us to be unsatisfactory because of its inherent limitations (viz. the local 
nature of the principle of equivalence and the assumption that the field is 
uniform). Another disadvantage of this approach is the necessity of using the 
ill-defined concept of a frame of reference in accelerated motion. Our approach 
is free from these disadvantages, being based on the direct application of the 
law of equality of inertial and gravitational masses. It is well to remember 
that in the derivation of Einstein’s gravitational equations we made no use at all 
of any frame of reference in accelerated motion and therefore also no use of the 
principle of equivalence. As for this latter principle, to the extent that it is 
valid it may be obtained subsequently as a consequence of the other assump- 
tions. Thus it is implied by the hypothesis that space-time has Riemannian 
character, its mathematical expression being the possibility of introducing a 
locally geodesic coordinate system along a time-like world line. 

We stressed the approximate character of the principle of equivalence. But 
from the point of view of Einstein’s theory of gravitation the law of equality 
of inertial and gravitational mass also is of approximate character, since the very 
concepts of mertial and gravitational mass are approximate. These concepts 
are applicable to the extent to which Newton’s laws of motion and law of 
gravitation are valid and to the extent that it is possible to define any mass as 
a quantity characterizing a particular body independently of its position and 
of the motion of other bodies. In Einstein’s theory of gravitation this is possible 
only approximately, because there the law of motion of material bodies is of a 
more complicated nature. Nevertheless, one can affirm that the law of equality 
of inertial and gravitational mass agrees fully with Eimstein’s theory of gravi- 
tation, because this law follows from the theory with as much precision as can 
in general be given to its formulation. 

On the other hand Einstein’s theory of gravitation does not reduce to a for- 
mulation of the law of equality of the masses; it embraces essential new physical 
principles. There are two such principles. The first is already contained in the 
ordinary theory of relativity: the unification of space and time into a single 
four-dimensional manifold with an indefinite metric. This principle is related 
to the limiting nature of the velocity of light and, closely connected with this, 
to the more precise specification of what is meant by a sequence of events in 
time and also by cause and effect (Section 12). The second principle establishes 
the unity of metric and gravitation; it is the very essence of Einstein’s gravi- 
tational. theory. 

It is just these two principles, and not any widening of the concept of rela- 
tivity, supposedly possible as a result of the local equivalence of acceleration 
and gravitation, which form the basis of Einstein’s theory of gravitation. 
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62. On the Clock Paradox 

To conclude this chapter we consider the so-called clock paradox. We dwell 
on it not because it is an especially important or abstruse question but because 
the paradox has been very widely discussed in the literature and some not 
wholly satisfactory resolutions have been given. 

The paradox arises from using incorrectly the concept of relative motion and 
from ignoring the distinction between inertial and non-inertial frames of 
reference. It consists of the following. 

We imagine a clock A to be at rest in some inertial frame of reference. Let 
a clock B move past A with a constant velocity v and then, after it has covered 
a certain distance, let it be subjected to a negative acceleration which reverses 
its speed so that it again passes A, this time with velocity —v. At the two 
instants at which B is travelling past A the readings of the two clocks can be 
compared directly without the intermediary of light signals. Such a com- 
parison will show that clock B is running slow compared to A, or at least this is 
the result of applying the expression for proper time given in Section 14. 

Now, motion is relative. Therefore clock B could also be thought of as at 
rest. Then the other clock, A, must be looked upon as first receding from B 
with uniform speed and then approaching again so that the same equations of 
Section 14 appear to predict also that A should be slow compared to B, in 
contradiction with the previous result. 

The difference in the readings of two clocks situated at the same point in 
space is an absolute and objective fact. It depends neither on the frame of 
reference nor on the method of description. Therefore, as long as they are 
correct, all methods of description must lead to the same result. The contra- 
diction obtained shows that an error was made somewhere in the discussion. 

It is not difficult to see that the error consists in failing to take into con- 
sideration the fact that in the imagined experiment the clocks A and B were 
subjected to different physical conditions. Clock A did not receive acceleration 
and experienced no impulses whereas clock B was accelerated ; it experienced 
an impulse which reversed the direction of its velocity. In other words the 
error consisted in assuming that the two frames of references connected with 
clock A and clock B respectively were to be treated as equivalent, which they 
are not ; only the frame of J is inertial. 

This is the qualitative resolution of the paradox. It has the significance that 
the equation for proper time given in Section 14 


t= { Ni; (1 Z =) (62.01) 


is valid only in an inertial frame of reference, but not in an accelerated one. 

By what should this equation be replaced in the case of an arbitrary frame 
of reference? The possibility of such paradoxical results will in any case be 
excluded if we take as our formula for proper time the invariant expression 


t 
i 
=- | Woo + Works + Jetty) at (62.02) 
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Let U, be the constant value of U at the position of A. If B passes A for the 
first time at t = 0 and then, when returning, again at t = T the difference in 
the readings of A between the passages is 


T 
U U 
1m | (1 a at)at = (1 — os)r (62.04) 
0 


Here the influence of the gravitational potential on the rate of A is taken into 
account (see equation (51.14)). For the difference of the two readings of B 
over the same length of time we get 


T 
1 
tp = | f — 5 (ge? + ua (62.05) 
0 
Therefore B will be slow compared to A by the amount 
T 
1 
ut =5 { (Av? + U — U,)dt (62.06) 
0 


where the integral must be taken along the trajectory of clock B. By the law 
of conservation of energy we have 


1,,2 _ =— 142 _ 
VU U= 3 U. 


0 


(62.07) 


where ¥, is the value of the speed of B at the point where U = U,. The relation 
_(62.07) enables us to write (62.06) in various different ways, for example, as 


eee ne z[¢ (42 + 2U —2U,) dt (62.08) 


We now assume for the potential U that in the region considered it has the form 
U =U, (for x < 2) 
U=U,+ 9(2, — 2) (for x > x) 
Let clock A always be at the origin of coordinates and let B move along the 
z-axis. The x coordinate of B will then be 


(62.09) 


L = Vol (for t < t,) (62.10) 
x= 2, + v(t — t,) — dg(t —t,)? (for t, <t < ¢,) (62.11) 
and X= XL, — v(t — f,) (for t > ¢,) (62.12) 


Here ¢, and ¢, are the times at which B passes through the point + = z, on its 
motion to and fro. These times are 


Ly 29 
i= t=t +— =t,+¢@ (62.13) 
Vo g 
2 
where a (62.14) 
g 


is the duration of the uniformly accelerated motion. The time of return of 
B to x = Ois, by (62.12) 


x 
Pot =hth (62.15) 
0 
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In calculating the integral it is most convenient to use (62.08) because in it the 
quantity $v? is constant and the difference 2U — 2U, differs from zero only 
in the region t, <¢ <t, where (62.11) is valid. An elementary calculation 
using (62.14) gives 


=a (EE — 3¢*) (62.16) 


On the other hand, if we had unjustifiably used equation (62.01) for the 
accelerated motion defined by (62.10) and (62.12) we would have obtained 
(62.16) with a factor } (instead of 2) in the ¢* term in brackets. Thus in the 
factor multiplying vp?/c?_ there is only the term 47 if acceleration is neglected; 
the terms $7 — 4£* give the kinematical effect including acceleration, and the 
full physical effect (kinematical and gravitational) is given by the terms 
47 — &0*, as in formula (62.16). 

It follows from (62.16) that if the duration of the accelerated motion is ?7, the 
clock B is not observed to be slow at all, and for ¢* = T it is even fast. It should 
incidentally not be forgotten that (62.16) is not general but only derived under 
rather special assumptions concerning the nature of the motion. 

We have performed the calculation in the inertial frame of reference. To 
repeat it in a frame connected with clock B would have no sense for we should 
only be evaluating the same integrals in terms of different variables. 

In conclusion let us note that in clarifying the clock paradox we deliberately 
avoided using the principle of equivalence. We did so because the approximate 
nature of that principle might give reason to doubt whether any resolution of 
the paradox based on it is exhaustive. It was our intention to avoid all such 
doubts. 


CHAPTER VI 


THE LAW OF GRAVITATION AND 
THE LAWS OF MOTION 


63. The Equations of Free Motion for a Mass Point and their Connection 
with the Gravitational Equations 
In the preceding chapter we have already made use of the assumption that 
in a given gravitational field a mass point moves along a geodesic. However, 
this assumption does not represent an independent hypothesis but may be 
considered to be a consequence of the gravitational equations together with 
an assumption concerning the form of the mass tensor. To obtain this result 
the gravitational equations are required only in so far as they lead to the 
relation 
v, T= (63.01) 
which expresses the divergence-free character of the mass tensor. We can 
obtain the equations of motion for a mass point from those of a continuous 
medium by going to the limit of a concentrated mass. We can take the mass 
tensor 


1 
fe a o*ubuY (63.02) 


where, as in Section 48, p* is the invariant mass density and u’ the four- 

dimensional velocity. These quantities are related by the equation of continuity 

Vi(p*u’) = 0 (63.03) 

We shall now give two derivations for the equations of motion of a mass 

point : the first will be based on a variational principle, the second on a direct 
application of (63.01). 


In Section 47 we established a relation according to which the variation of the 
action integral 


S = | ctp*/(—g)- (da) (63.04) 

was equal to 
3 | ctp*y/(—9)- (da) = — { p*(uV,uJEV(—g) (dz) (63.05) 
where &° was an infinitesimal displacement vector. (Equation (63.04) differs 
from (47.46) only in its use of the symbol p* for the invariant density.) On the 


other hand, if the same action integral is varied with respect to the g,,, then, 
by (48.03), the result is : 


3,[ ce*/(—g)-(de) = } { ptwtw8g,/(—g)-(dx) (63.06) 


According to the general relation (48.22) this shows that the action integral 
(63.04) really corresponds to the mass tensor (63.02) 
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In the action integral (63.04) we can go to the limit in which the invariant 
mass density p* differs from zero only in the neighbourhood of a single point 
in space, the volume integral of the density over a volume surrounding that 
point being finite. 

The equation of contimuity (63.03) may be written in the form 


ie (V(-9)- ate} = (63.07) 


Multiplying this expression by da, dz, dz3, integrating over the volume des- 
cribed and using the fact that p* vanishes at the boundaries of the volume, 
we obtain 


d 
di | e*u/(— 9) dx, dz, dz, = 0 (63.08) 


(we have written ¢ for x, to emphasize the meaning of this relation). Conse- 
quently the value of the integral 
| e*u%/(— g)-dx, dx, dz, = mc (63.09) 
is a constant, independent of time. But the quantity o* differs from zero only 
around one point. Therefore the factor u° may be given its value at that point 
and taken outside the integral. Its value is 
dt c 
Fae ae ee (63.10) 
dt V (Goo i 29k; = Git Ly) 
where the 2; are the spatial coordinates of the mass pomt and the 2; their 
derivatives with respect to the time. Using this value of u° we get 


| o*s/(— 9) da, dary dry = M/(Joq + 29, + IX %,) (63.11) 


The action integral can be obtained from this expression by multiplying by 
c’dt and integrating with respect to time. Therefore 


t 
S = me? [ (Goo + Gores + Justis) at = me? fac (63.12) 
7(0) 
This expression differs only by a constant factor from the integral considered 
in Section 38, the variation of which gives the equation of the geodesic. Thus 
the variational principle leads to equations of motion for a mass point 


8S = 0 (63.13) 


which coincide with the equations of a geodesic. 
We now derive these equations in another way, starting with the fact that 


the divergence of the mass tensor vanishes. 
Equation (63.01) may be stated in more detail as 


Sey 9 P+ & V4) TY + V(— 9) Tigh = 0 (68:14 


Multiplying it by dz, dz, dz, and integrating over a volume at the boundaries 
of which the tensor 7“ vanishes, we get 


‘ | V(— 9): dey dir, dry + | V(—g)-Pug8% dex, diz, dx, = 0 (63.15) 
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Here we can insert the expressions (63.02) for the components of the mass 
tensor. If the previous equation is multiplied by c? it then assumes the form 
| o*uou’ 4/(— 9) dx, dx, dx, + [ Petey 9): dz, dz, dz, = 0 
6 

(63.16) 


The integrals involved can be calculated by the same means as the integral in 
(63.11). Using (63.09) we obtain 


| o* uu 4/(— g)-dar, dx, day = meu" (63.17) 
me 
and | Ty,p*urul 4/(— 9) da, da, dry = aa Tyuau® (63.18) 


where all quantities are to be taken at the position of the mass point. Replacing 
dt by wdt and dividing by the common factor mc/u° we obtain from (63.16) 


du’ 
k +TPygu*u® = 0 (63.19) 
dz. da dix, dz. 
gia 2 8 _ Q- eM aseggv 63.20 
7 dz? a Us dt dt : dt i ( ) 


These are the equations of a geodesic in explicit form. We have thus again 
shown that the equations describing the free motion of a mass point are identical 
with those of a geodesic. Our derivation shows that these equations can be 
obtained directly from the equation VT” = 0 by integrating over volume 
and subsequently passing to the limit of a concentrated mass. 

Here it is important to note the following. The operation of averaging over 
the volume is generally speaking not unique: it depends on a weight function. 
If before integrating over the volume we had introduced a weight function A 
we could have obtained a result dependent on >. However in the present case 
of a mass tensor of the form (63.02) the factor A cancels out in the limit of a 
concentratcd mass, and unique equations of motion are obtained. 

We note that if the ordinary time t = 2, is taken as the independent variable 
instead of the proper time + and if the previous equations are multiplied by 
(dz/dt)? they appear as 

dz, da, dx, dx, dx, dx, 


eae aa ta a” wee 
The equation for v = 0 reduces to an identity. 
The quantity 
vo du’ 448 
a + Ty utu (63.22) 


which appears on the left-hand side of the equations of motion is the contra- 
variant acceleration vector of the particle, as it was given by (46.26). 

Let us now clarify the precise meaning of this acceleration. If no gravitational 
field is present the quantity w’ is the ordinary acceleration; in Galilean co- 
ordinates the spatial components of this vector tend in non-relativistic approxi- 
mation to the second derivatives, with respect to time, of the cartesian co- 
ordinates of the particle. When a gravitational field is present this will not be 
so. To determine the non-relativistic limit of the w’ in this case one can compute 
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the approximate expressions for the Christoffel symbols P3, which follow from 
the expressions for the metric tensor derived in Section 55. It is convenient 
to postpone these calculations until Section 65, and bere we only quote their 
result. Taking first the Ty, with zero superfix we find that 

1 aU 10U 
—_—_— —* [To —— ee 

c® at’ a c? Oxy (63-29) 
while the others are small. Among the I With a spatial superfix the most 
important are aU 


a = (63.24) 


Further, we require approximate expressions for the four-dimensional velocity. 
Denoting the ordinary velocity by 


ia. 


st 63.25 
aE Sa 
we can put, approximately 
dx; 
i'n », (63.26) 
dz 
dt 1 
and us — ae ~14-—(§v? + U) (63.27) 
T Ce 
Insertion of these expressions into (63.22) gives 
dv, au 
pa ee 63.2. 
~ dt ou; ( ) 
1 dep | 
and o~ — | — — — 63.29 
ae c : ( dt CxE ( ) 


(Here summation over k from 1 to 3 is implied.) 

These equations show that the spatial] components of the vector w’ represent 
the acceleration of the particle apart from the acceleration due to gravity, while 
the zero component is proportional to the work done on the particle in unit 
time by all forces except gravity. 


64. General Statement of the Problem of the Motion of a System of 
Masses 


The problem that will now concern us has already been generally characterized 
in Section 54. We consider a problem of an astronomical type, relating to the 
motion of celestial bodies in empty space. It is known from astronomical 
observations that the distribution of mass over the universe is far from wniform: 
an overwhelmingly large part 1s concentrated in the form of separate celestial 
bodies at great distances from each other. We shall assume accordingly that 
the components of the mass tensor vanish throughout space, except in some 
separated regions whose dimensions are small compared to their mutual 
distances ; each such region will correspond to a celestial body. 

Within each body the mass‘tensor must in the first place conform to whatever 
physical model we accept for the body (that of a gas, a liquid or an elastic solid, 
étc.), and it must further satisfy the requirement that its divergence vanishes 

V,TH = 0 (64.01) 
To achieve conformity with some physical model one must express the com- 
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ponents of the mass tensor in a definite manner in terms of functions of the 
state of the physical system that makes up the body. (Such functions of state 
are density, velocity, pressure, etc., see for instance (55.02).) 

The form of the mass tensor will depend not only on the physical properties 
of the body in question but also on the metric. Therefore its components will 
involve not only the state functions in the narrower sense but also the mctric 
tensor. The metric tensor and its derivatives will also enter into the divergence 
expression (64.01). 

Thus, if one wishes to write down the mass tensor one must know the metric. 
But the metric has to be determined from Einstein’s equations which have the 
mass tensor on their right-hand sides. It is therefore evident that only a joint 
determination of the mass tensor and the metric as functions of the coordinates 
can be possible. 

Let us compare the problem we are considering with those encountered in 
Newton’s theory of gravitation. There the most common problem is to deter- 
mine the gravitational potential from a mass density that is assumed to be 
known. But there are other more complicated problems in Newtonian theory 
in which the potential must be determined simultaneously with the density. 
A famous problem of this kind is that of the shape of a mass of liquid in 
rotation. This problem was studied by Liapunov in Russia and by Jeans and 
Poincaré elsewhere. Its rigorous solution was given in Liapunov’s posthumous 
work [21, 22]. The problem we are concerned with is reminiscent in its general 
character of Liapunov’s problem, but in our case one has to find two tensors, 
the metric tensor and the mass tensor, instead of two scalars, the Newtonian 
potential and the density. In addition, we must deal not with equilibrium but 
with motion. It may be noted that in Section 73 we shall also encounter the 
basic equations of Liapunov’s problem. 

Our problem is simplified in the first place by the fact that the metric nowhere 
deviates greatly from the Euclidean ; the table given in Section 58 gives an 
idea of how small the deviation is. A further simplifying circumstance is 
that at all significant distances from the bodies, the metric does not depend 
on the detailed internal structure of the latter, but only on certain overall 
characteristics. Such characteristics are the total mass of the body, its moments 
of inertia, the position and velocity of its mass centre and so on. The Newtonian 
potential of a body depends on thcse same quantities. 

. To solve Einstein’s equations we shall use a method of approximation which 
is a development of the method of calculation employed in Section 55. It is 
based on an expansion of all required functions in inverse powers of the speed 
of ight. An expansion that can formally be so described will, in fact, be an 
expansion in powers of certain dimensionless quantitics, such as U/c? and 
v?/c? where U is the Newtonian potential and »? the square of some velocity, 
say the velocity of one of the bodies. For systems to which the virial theorem 
applies the two quantities U and v? will be of the same order, of order q® say, 
where g is @ parameter of the dimensions of aspeed. (We have already used this 
parameter in Sections 55 and 58.) Then the quantity g?/c? can be used as the 
dimensionless parameter in terms of which the expansion proceeds. 
ea BA en has equations by introducing corrections for retardation we 
y that the dimensions of the system are small compared to the wavelength 
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of the waves emitted, which in this case are gravitational waves. This assunip- 
tion is not independent of those made above. For if the angular velocity of the 
revolution of a planet is denoted by and the radius of its orbit by R we can 
write the condition for the smallness of the system as R < c/w since c/w is the 
wave length of the gravitational wave divided by 2x. On the other hand, the 
condition that the speed v = Rw of the planet shall be small compared to the 
speed of light has the form Rw < ¢, which is the same as the previous inequality. 

In astronomical problems we are interested in “ quasistationary ” states of 
the gravitational field, i.e. states which are established after many planetary 
revolutions. The solution of the wave equation obtained by introducing cor- 
rections for retardation satisfies the conditions for a quasistationary state. 

We have already remarked that in most astronomical problems the distances 
separating cclestial bodies are extremely large compared to their linear dimen- 
sions. If R is a length characterizing the order of the separations and LD a 
length characterizing the linear dimensions of the bodies we have the inequality 


L<«€R (64.02) 


The use of this inequality introduces considerable simplifications into the 
calculations of the gravitational potential and of the metric tensor outside the 
masses : aS was mentioned, these quantities become independent of the detailed 
internal structure of the bodies. Therefore, we shall also use the inequality 
(64.02) though it is less essential than the inequalities v? < c? and U < c? on 
which our method of solving Einstem’s equations is based. 

We note that on the surfaces of the bodies and in their interiors, where the 
gravitational potential is greatest, we shall have, in orders of magnitude, 


Ue? = afL (64.03) 


where « is the gravitational radius of the body (see the table in Section 58). 
Therefore the two inequalities to be used can be written as 


a<LéR (64.04) 


Our problem consists in determining the metric tensor and the mass tensor. 
If we know the components of the mass tensor as functions of the space and 
time coordinates we also know the motion of the masses, for the masses are 
just the regions in which the mass tensor differs from zero. It is essential to 
note that the motion of these regions cannot be prescribed in advance ; the 
laws of motion are a consequence of the laws of gravitation. 

The solution to our problem gives us, firstly, some approximate expressions 
for the mass tensor and the metric tensor which involve certain unknown 
functions ; secondly, it gives us equations for these unknown functions from 
which they can be determined, given certain initial conditions. These are the 
equations of motion. 

In the course of solving the problem it becomes apparent how to choose 
these unknown functions. The procedure of solution leads in a natural way to 
just the quantities that are used in Newtonian mechanics, such as the mass 
centre coordinates of each body, its. total mass, its angular momentum, Its 
moments of inertia and all the other quantities characterizing it. 

For all these quantities equations of motion are obtained which, to first 
approximation, coincide with those of Newtonian theory and to second approxi- 
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mation deviate from them by small corrections. We shall be interested both 
in obtaining these corrections and in expressing the metric tensor and other 
quantities of Hinstein’s theory in terms of Newtonian quantities. 


65. The Divergence of the Mass Tensor in Second Approximation 
We shall attack the problem of determining the mass tensor and the metric 
tensor in a series of steps, starting from the discussion of Section 55. We recall 
that discussion. In the initial approximation the metric is taken to be Euclidean, 
which corresponds to complete neglect of the forces of gravity. In that approxi- 
mation it was possible to prescribe only the components T° and T% of the mass 
tensor. According to equation (55.03) these components appear in Galilean 
coordinates as 
Lae Se 5 Q; ite 5 ov; (65.01) 

where ¢ is the density and ; the velocity of the ponderable matter at any point, . 
with 

Gp | \ 4 (eu) 


=0 02 
a ar vo 


The spatial components 7** were not determined in this approximation ; 
it was only assumed that their order of magnitude is the same as in the absence 
of gravity. 

These assumptions concerning the mass tensor proved to be sufficient to 
determine the first approximation for the metric. According to (55.31) we have 


Joo = 0 — 20 
4 
Io = oe Ui (65.03) 


where U is the Newtonian potential of gravitation satisfying the equation 


AU = —4ryo (65.04) 
while U; is the vector potential of gravitation satisfying 
AU; = — Anyeu (65.05) 


It must be remembered that the potentials U and U i are related to the mass 
density e and the mass current ev; in a non-local manner; the value of the 
potentials within a given body depends on the distribution of density throughout 
space and not only within the body itself. 

In Section 55 we also gave equations for the contravariant components of 


the metric tensor, namely 
1 2 
gre A(t 5 2) 
c 


ce 


_ 4 
=U (65.06) 
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We had also, according to (55.37), 


Knowing the metric to the approximation of these equations we can take the 
next step in the construction of the mass tensor. To this end we must first 
write down more precisely an expression for its divergence. This is the aim of 
the present section. 


According to (41.24) the general expression for the divergence of a symmetric 
tensor has the form 


oT 
Ve = aa ae eae (65.08) 
where for brevity we have put ; 
a 
y= 5 flog V(— 9] = (65.09) 


In any case the order of magnitude of the components of the mass tensor is 
given correctly by the equations (55.02). We have 


¢) P 
Too _ o() : Tot off i) : Tik — o(& ') (65.10) 


where g is the parameter used before which characterizes the order of magnitude 
of the velocity. 

Let us consider the zero component of the divergence of the mass tensor. 
It is given by (65.08) with » =0. In our initial approximation we neglected 
all but the derivative term in this equation. To take the next step we must 
know the quantities [, and y up to terms of the order of 1/c? inclusive. As 
for the spatial components of the divergence, they were neglected completely 
in the initial approximation. To take them into account in first approximation 
we must now find the leading term of IY, which does not contain a factor 1/c?. 
For the determination of y,, [, and I’, to the required accuracy the approxi- 
mation represented by (65.03) to (65.07) is sufficient. 

From (65.07) and (65.09) we readily obtain 


2 aU 2 aU 
=: ee eee 65.11 
Yo ot a? C8 Bary ( ) 
If we then calculate the Christoffel symbols from the well known relations 
Pe, = 9D, aa (65.12) 
re, yy 29 
wher Toe =3 +S - 65.13 
. wee (Fe " On, On, 83:72) 


it is easy to see that in the last equation our approximation allows us to neglect 
all terms containing the factor 1/c?. There then remains 


oU oU 
| (ee ee No = —-s (65.14) 


and also 
etal (65.15) 
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Hence we get for 19, : 


Re a Os, Seite (65.16) 
00 c2 ot 2 04 c2 Ox; 
whereas the T°, will be of higher order of smallness, namely 
re, = o(- (65.17) 
ik ~ “\ oa , 
Among the Christoffel symbols with a spatial superfix there occurs 
aU 
= 65.18 
Me = - > (65.18) 


while all the others will be small : 


1 1 
r= o( 5) ; t= o(5) (65.19) 


Having found the values of the Christoffel symbols we can now write down the 
required expression for the divergence of the mass tensor. 
Equation (65.08) with » = 0 may be stated in more detail as 


aT age : 
Vm = + S— + (By + ye) T° + (PG, + y,) TH +18, TH 
4 
(65.20) 
By (65.11) and (65.16) the coefficient of T° in the above is 
1 aU 
re = —— 2 
b+n-45 (65.21) 
As for the coefficient of T°, it proves to be negligibly small, namely 
1 
200, + y, = o(5) (65.22) 


By (65.17) the coefficient of T** is of the same order. 
Thus in our approximation equation (65.20) assumes the form 


a Sa aes 
a oe 
For the spatial components of the divergence, equation (65.08) with » =7 
gives 


VT" = 


éT® = aT ik 


Vi TY = ae fe aa + T4700 4 Qt Tok + yo T% 4 Tt TH 4 yp Tee 


(65.24) 
As may be seen from the estimates discussed above, the coefficients of the last 
four terms on the right-hand side will all have a factor of 1/c? so that these terms 
may be discarded. Expressing I4, by (65.18) we therefore get 


oT =6aTtk =6—aU 


ae a0 
at OL, = OG < 05:25) 


VT — 


Thus, if the deviation of the metric from the Euclidean, or, in other words the 
force of gravity, is taken into account, to first order, we must express the 
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condition of vanishing divergence by the equations 
ofeo eT% = aU 
at OL,  c* At 


and 
oTto ort 4] 
GE OE NOU epi (65.27) 
ot OL’ Oxy 


66. The Approximate Form of the Mass Tensor for an Elastic Solid 
with Inclusion of the Gravitational Field 
In Section 32 we obtained approximate expressions for the mass tensor of an 
elastic solid or fluid body, neglecting gravitational forces, by studying Umov’s 
scalar and vector of energy density and current. Thcse expressions were then 
written down in Section 55 in the notation we are now using (z) = t). According 
to (55.02) they have the form 


1 
c2T = p f + 5(0" + mn} 
: I 1 
c?T% — pv; f + eae ag rn) ~ Ge Pixve (66.01) 


etTtk — CViVE — Pik 


We note that here py denotes the three-dimensional elastic stress tensor and 
II the elastic energy per unit mass of the body. According to (30.08) these 
quantities satisfy the relation 


ays, (s+ =) 
Py ~ 2Pik Oe Oy 


(66.02) 
We now have to generalize the expression for the mass tensor so as to take 
the gravitational field into account. According to Newton’s theory the negative 
of the Newtonian potential is also the potential energy of a particle of unit 
mass placed in a given gravitational field. Therefore one would expect that tho 
required expressions for the energy density and energy current may be obtained 
by adding the terms (—peU) and (—pu:U) respectively to the expressions for 
Umov’s scalar and vector in equations (30.17) and (30.18). This gives 


S = 4ov? + o(Il — U) (66.03) 

and 
Si = vi {ov + o(T] — U)} — pinvr (66.04) 
The components of the mass tensor can then be obtained from the relations 
aro is 66.05 
c =p+ 2 (66.05) 

and 


1 
oT = py + 5 Si (66.06) 
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We shall thus finally have 


] 
Cr = 5 f Seer en et Go v)| 
C 


1] 7 1 
cert — ouM% f + oe + II — v)| — oa Dik Uk (66.07) 
c2 Lik = pvyv, — Pik 


If our argument is correct, then to the approximation required these expres- 
sions must satisfy the equations derived at the end of the preceding section, 


namely 
G8 OTS ABD 
at Orr c? at : 
OT 5 TEU ies ae 
ot OLk Om 
These equations must be satisfied by virtue of the equations of motion for the 
quantities involved in the mass tensor. We have, firstly, the equation of con- 


tinuity 


eal eae (66.09) 
ot Ox4 


and, secondly, the equations of motion of an elastic body in a gravitational 
field of acceleration 0U/éa;: 


26-2" (66.10) 


In addition, the relation (66.02) between the stress tensor and the elastic 
potential energy is valid. Using (66.09), (66.10) and (66.02) we find that Umov’s 
scalar and vector are related by 


aS aS; aU 
wo oe ae (66.11) 
whence with the use of (66.09) we obtain 
oT) = aT ok 1 au 
ai ae ae (66.12) 


Since p =c?J® approximately, equation (66.12) and the first equation of 
(66.08) coincide in the approximation in question. As for the remaining 
equations of (66.08), after multiplication by c? they coincide approximately 
with the equation of motion (66.10) written in the form 


(ev) O( OV x) ODik oU 
a = Sp 0 66.13 
at r CLE CLE P On ( ) 


(here it is sufficient to take the leading terms in the expressions for 7 and i da) 
Thus the equations (66.08), which express the fact that the mass tensor 
has vanishing (non-Euclidean) divergence, are indeed approximately satisfied 
if the quantities (66.07) are taken to represent the mass tensor. 
In the equations (66.07) the density p satisfies the equation of continuity 
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(66.09). The corresponding generally covariant equation has the form 
a 
Fp (V(— 9) e*ur} = 0 (66.14) 


where p* is the invariant density and u* the four-dimensional velocity. To 
make (66.09) and (66.14) coincide we can put 


1 
e=—v(—9)-p*u° 

: (66.15) 
ou = — V(— 9): put 


(The factor 1/e has been included so as to make p* approximately equal to 9.) 
Noting that by (65.07) and (63.27) 


] 2U 1 
Pe a ae a ce w= 1+ Sv? + U) (66.16) 
we get, approximately 
1 
= oll — ane + 30) (66.17) 


The equations (66.07) allow-one to write down an expression for the invariant 
of the mass tensor. Introducing the pressure by the equation 


P = — 4(Pir + Poo + Pas) (66.18) 
1 3 
we get yi of} +5(—bts I 30}—Sp (66.19) 
: c 
and using the equation (66.17) for p* 


; i I 
Lf = 0" (1+5)- 


» (66.20) 
This last expression agrees with (32.30). 


67. Approximate Expressions for the Christoffel Symbols and Some 
Other Quantities 


In order to take the next step in the determination of the metric tensor 
we must extend the calculations of Section 55. As all our calculations will be 
(lone in harmonic coordinates it is convenient to use as unknown functions the 
contravariant components of the metric tensor multiplied by +/(— g), i.e. the 
quantities 

g* = +/(—9)-g” (67.01) 
The advantage of using these quantities comes from the fact that they permit 
the harmonic condition (55.79) to be stated in the simple form 


agt" 
S26 (67.02) 


ex, 
so that it is linear in the unknown functions. A further advantage of this 
choice of unknown functions is that the spatial components g** differ very little 
from constants. Finally it is also most convenient that the left-hand side of 
every one of the gravitational equations involves the d’Alembert operator 
applied to the appropriate component g"’, so that approximately each com- 
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ponent g"’ is connected only with the single component of the mass tensor having 


the same indices. 
In Section 55 (equations (55.38)) we found approximate expressions for 


the g’’: 


age 
8 c c3 
4 
a = 5 Us (67.03) 
g* = — ip 


Here U is the Newtonian potential and U; the gravitational vector potential. 
We now use more accurate expressions obtained by adding the next terms in 
the expansion in inverse powers of the speed of light. We have then : 


Reietin i 4U) 48 
aa ee Ban 
4U, 48 
got = =; 4 = (67.04) 
48 
git = —cdiy + a 
c 


Using these expressions we can calculate the Christoffel symbols and the 
other quantities that enter the gravitational equations. 

We begin by calculating the determinant g. As is easily verified (see equation 
(B.67)) it is equal to the determinant of the g"’. The equations (67.04) give 


| 4U = 


1+ += (67.05) 


Here we have used the summation convention for spatial indices and have put 
3 
=1 
Extracting the square root we obtain further 


2U | 28 — Sex — 202 
vi-9 =e +4 | 


ct 
We introduce a special symbol for the fourth root 


- ( = A 2% (67.08) 


U 1 


(67.07) 


By (67.06) this quantity is 


The quantity f will approximately satisfy a linear differential equation which 
will be derived in the next section. 


From (67.04) and (67.07) we obtain for c?g® the expression 


2U | 28 + 28h, — 2U? 
a era RSE (67.10) 


cf 


1 
If we put OF = Ua as (S + Sex — 2U?) (67.11) 
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we have with the same accuracy 


c? + U* 
E298 = 2 oF (67.12) 
1 c? — U* 
and also =3 Joo = aut (67.13) 
We shall encounter the quantity U* in the following. 
We now pass on to the calculation of the quantities 
1 uB lee aB 
[#28 — (0 eg + gfe eg — gi "} (67.14) 
2g ox. er, 0x, 


These are introduced in Appendix B and are related to the Christoffel symbols. 
Using (67.04) and (67.05) we obtain the following set of equations 


fen 
c® at 
2 aU 
001 == 67.15 
u c! ox) ( ) 
pon 2 (2 4 20H 
ct Ox) Ory, 
2(aUu  1/a@ aU ao)| 
(002 A A et 
ang ” c4 ee als Ox; v: ot 
2 (aU; er) 
Ae KOe OS oo 67.16 
Ht “(ze Or; ( ) 
[lt #2 — o(=] 
ct 
If we lower the second and third superfix we obtain the II%,. We find 
i 2 aU 
Too ce at 
2 aU 
y= 67.17 
Mk = 45 (67.17) 
1 
ns = (2) 
: au =1/a aU =) 
os CT) a te 
es I¢0 (= alee Ox1 “ls ct 
i= — (2 = as (67.18) 
re c2\ Or, any 


; ] 


In calculating II¢, the value go) = ce? — 2U is needed, but for the calculation 
of the other quantities it is sufficient to use the Galilean values of the metric 
tensor components. In the equations for the I, we have restricted ourselves 


to terms of the erder 1/c?. 
In order to evaluate the Christoffel symbols we require not only the Ij, 
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= 4(y,d% + y ot — ¥" Gua) (67.19) 
0 as 
where — eat yt = gy, (67.20) 


a 


(see Appendix 1). We state the approximate values ot the y,. Differentiating 


(67.05) we obtain 
ape 1 (= au ou | 


Ye ea ae aa Ba 
ColOx, = c*\ 0X4 On Ox4 67.21 
2(aU 1 /as dU OS ray) 
Y= 3A— =| —- 40 = - 
c*\ ot c*\ ot at at 


Retaining terms up to the order of 1/c* we find the following values for the y* 
with upper indices 


2(0U 1/as oU OSxk 
Y= — lan | Namo” Bae Ome 
: 67.22 
2 aU Ot) 
Uris ag 
c+ at 


The Christoffel symbols are expressible in terms of the quantities just 

derived as 
Tu, = Tk, + Ake (67.23) 
If we confine ourselves to terms of order not higher than 1/c? we find for yp = 0 
just as in the approximation (65.16) : 
1 aU 1 aU 1 

whereas for » =12 we obtain expressions that are more accurate than (65.18) 
and (65.19) : 


oU 1 (as OSkk ou ou; 
Peet terse eee ete en Bp. pba 
* Ox, =? (= Cx4 au Ox; a5 ot 
1 aU 2 (eu; OU, 
Ok c2 at tk C2 (= On; (67.25) 
1 /eU oU oU 
It, =— |— 3; — -—_ 
kl 7 (5 k + ae Se are bu 


Im approximate calculations it is convenient to use equations arranged in 
such a form that the IT,4 are involved rather than the I',4. The advantage 
of the former quantities over the latter may be seen by comparing (67.18) 
with (67.25). These equations show that the II{, may be neglected, but the 
Tt, may not. 

In Appendix 1) we obtain an equation for the Einstein tensor which has 
heen brought to the form described; this is equation (1).87). It involves the 
Lagrangian, which by (D.95) is equal to 

1 Age 
L _—— ee ee ea II’ pai ae i v 
2>/(— 4g) af ex, Ba ZY (67.26) 


We derive the approximate value of this expression. From (67.21) and (67.22) 
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we get 


4 (eu\2 4 8U\ (aU 1 AS — Sxx)\? 

Sg) = ee en es 

v(— 9) od aa) ( ao. +5 On, 
(67.27) 


On the other hand, if we use (67.17) and (67.18) in conjunction with the initial 
expressions (67.04) for the g*” we find 


Le 20? _ a2 16 aU avy 
~— eat) | C8 aay Ot 
ae (1 ea] i 2 =)'-5 (< ~ oy (67.28) 


c c2/\éda;  c* Oxy c® \ Orr 0x4, 


Adding this to one half of the preceding equation we obtain for ,/(— g) times the 
Lagrangian 


2 8U oU\2 2aU (as OSkk 
Tang) Se ies) | ah) a ee 
v9) 3( =) (=) T 3 Ox4 (= + =) | 
6 (=) 16 0U aU, 4 (= ou; 


2 
etl 67.2 
Be a aie!) 


c§ 0x, at ce 


Ox k Ox 


Introducing U* from (67.11) this expression may be written as 


2/aU*\? 6 (aU*\? 16 aU* aU, 4 (aU; aUy\* 
iva 3 (Se) tas (ae) hele a) 


ce \ dx; oc} \ at. © Oxy a OLE on 


(67.30) 
Here we have replaced U by U* in the correction terms. The last equation 
is remarkable in having on its right-hand side a homogeneous quadratic form 
in the first derivatives of the four quantities U* and U; with constant coefficients. 

By comparing (67.27) and (67.29) we see that the sum 


V(— 9) (L 2 24a") 
_ 8 fa OSxx 1 (< se) (Sy 9oU au; 
8 lay Ou, = 2 bay. “at aa; at 
will be of a higher order of smallness than (67.27) and (67.29) separately. This 
remark enables one to derive a very simple approximate expression for the 


curvature scalar F. According to the relation (1.49) we have in harmonic 
coordinates 


ee 67.31 
OL ( 


R = g a log /(—9) =f 


(67.32) 
OL, OL, 
Using the definition (67.08) we may write 
Way iP (67.33) 
whence we get 
‘ 9 9 ; 
Pa ee ae ee ee (67.34) 
f° 02,80, f?" Cx, Oa, 
But we have 
pe (67.35) 
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Therefore _2 4 Of ree ay f 67.36 
Rai a oe td TD eL20) 


We now pass to the approximate equations. According to (67.31) the quantity 
(4y,y* + L) will be of sixth order in 1/c. Discarding quantities of sixth order 
we can replace the g% by their Galilean values. The curvature scalar & then 


assumes the very simple approximate form 
2/1 af 
=f ee 67.37 
P ig (3 at? f) ( ) 
where A is the Laplace operator with Euclidean coefficients. 


68. Approximate Form of the Gravitational Equations 
The calculations of the preceding section permit us to write the left-hand 
side of Einstein’s gravitational equations 


RY — tg"R = — ea Tw (68.01) 


in the approximation corresponding to the required accuracy. Let us first 
write down the value of the left-hand side without any approximations. 
According to (D.87) we have 


1 d%quy 
Rw — $g’R es g7® g 


WaBTTY — Latav 
ig ° Oa, dn, + TTI — by4y 


+ tg"L + 4g"B — BY (68.02) 
Here L is the Lagrangian 
which was already encountered in the preceding section. According to (D.85) 
the quantities B“Y are given by the equations 
Be =P +34 ty) =U EY EO Eyl — (68.04) 


where the IY are the quantities introduced in Sections 41 and 53: 
1 ag 


Gites ee 68.05 
V0) Ba aa 

The quantity B is the “ quasi-invariant ” formed from the BY” 
B= 9, BY = (V, +y)P> (68.06) 


It is not a true invariant because the B’ do not form a tensor. 

In a harmonic system of coordinates the I’ vanish and, therefore, also the 
I’ and so do the B“ and B. 

We note that as long as the left-hand side of Einstein’s equations (68.01) 
contains the complete expression (68.02) for the conservative tensor, the 
equation 


V.Ew = 0 (68.07) 
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is a consequence of the equations themselves. However, if from the start 
we drop the terms in B“Y and B from the conservative tensor the equation 
(68.07) will only be true if the conditions TY = 0 are satisfied. 

Passing to the approximate form of Einstein’s equations we shall first consider 
the terms involving second derivatives. Using (67.04) we get 


an lap 4 op ap an 

a Bh ey ci et he i SO pas 

ax, Bary oan eT | a Tore ee =) 
(68.08) 


Here the leading terms combine to give c times the Euclidean d’Alembertian 
while the terms with variable coefficients represent corrections which may be 
neglected in the approximation considered. We consider in more detail the 
order of magnitude of these corrections. The expression (68.02) for the Einstein 
tensor involves the quantity (68.08) divided by 2g, if the corresponding com- 
ponent of g” is inserted for ». The derivatives of ¢ will be of third order and since 
g = — c* approximately, the division by 2g makes the correction terms in (68.08) 
of eighth order in 1/c. Since we intend to neglect terms of this order-we must 
evaluate all other quantities involved in (68.02) to a corresponding accuracy. 

To avoid denominators involving g we shall evaluate not the Einstein tensor 
itself but, instead, that tensor multiplied by a factor which differs little from 
unity, namely (— g/c?). Then to the required accuracy the terms involving 
second derivatives will be 

1 4, ag” 1 1 age 


— 559" = = Age — = 68.09 
90° Ge, am, Be 9 ~ Dd aH? a 


In evaluating the terms involving first derivatives we shall first use the relations 
(67.15) to (67.18) to obtain the quantities 


Deere aa) 68.10 

(- “yn Mt, = — 35 (68.10) 

pee ieee Oe a \em (68.11) 
c Bo 8 Ot Oxy 8 \ Oa, 0x4} OX 


4 8U\ (aU 1 os 2 aU,;\ (eU nimtaw) 
= (1-3) (Ga, o Oa ales ct Om * ot Ot 


aoe aS ous 


c& 


oS ) (68.12) 
OLs Ox 


Os - OLE 


In the equations (68.10) and (68.11) we need not have included the factor 
(— g/c?) because in the present approximation it may be replaced by unity. 
Further, using the expressions (67.22) for y* and introducing the notation 


Nw = (- £) a “LT, — dy4y’ + 4g} (68.13) 


256 The Theory of Space Time and Gravitation 


7 (aU \? 
we obtain NO ee (= (68.14) 
c§ \ ozs 
qn l OU ew SoU e NS (68.15) 
c&§ ot Ox c®\ On, 02x; | CLs 
2 (aU* 4 0U;\(eU* 4 aUy 
Bia hoe ee ee ec ee 
pad la ct ( da, = c*™— at )( Or, «= c®_— bt 
[Oe (“ Z sm) —15,V-97 68.16) 
cé OL oxy OL— OLk 2 c 
1 2 (aU*\2 6 /aU\2 16 0U OU; 
whee Sv(-ob=3(Z2) tala) + wom oe 
2 
me [ _ a) (68.17) 
C8 \ oz, OX 


and U* is the quantity given by (67.11). Since U enters these equations through 
terms of sixth order it can be replaced by U* as was done in (67.30). Then the 
only first derivatives contained in the N” are those of the four quantities U* 
and U, and all the N“’ are homogeneous quadratic functions of these first 
derivatives, with constant coefficients. This represents a very great simplifica- 
tion of the exact equations (68.13). 

With the use of the expressions found we can immediately write down the 
approximate form of Einstein’s equations ; we have 


=v 1 1 oa%q 
(=2) ce — 198) = 7 aor — 5 


» OI pws 
53 ai + Nw = rae T* (68.18) 
where the N+” have the values (68.14) to (68.16). 

In order to use equation (68.18) for the determination of the g"’ the quantities 
U* and U; appearing in the N“’ must first of all be evaluated to the accuracy 
required. 

As regards the U; they enter into the expressions (68.14) to (68.16) only 
through terms of sixth order ; it is therefore sufficient to know them to the 
accuracy reached in Section 55. The relevant equations were also given in 
Section 65. (These equations can be obtained from the equation (68.18) with 
u =0, v=7 by neglecting N% and the second time derivative, replacing g 
by — c*, retaining only the leading term in 7% and expressing g™ in terms of 
U;.) We have by (65.05) 


AU, =—_— Ary ov; (68.19) 


On the other hand U* appears in terms not of sixth, but of fourth, order and 
it must be known with greater accuracy. By the definition (67.11) we have 


1 
Ut =U + <(S + See — 20%) (68.20) 


According to (67.04) the derivatives of U + S/c? are equal to the derivatives 
of cg%/4. Therefore equation (68.18) for » = v = 0 gives: 


A 1 @ 1 c4 
( -5 a) (u +5 8) = — FN + dmg oo (68.21) 
Further, the derivatives of Syx/c? and of cq*#/4 are equal. Putting p =v=k 
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in (68.18) and summing over k we get 
1 a?\1 Cc 
(4-5 aa) ga See = — 5 NH — dy wee 


(In the second term on the right we have replaced (— g/c?) by unity.) Finally 
we can write 


1 a?\ /—2U2 4 
Ral a) | =F (grad Ut + 1émyUT (68,23) 
Here we have neglected the second time derivative in the d’Alembertian and 
have used the identity 

A(U*) = grad U)? + 2U AU (68.24) 
and also the equation 
AU = —4ryp = — Arye? T° (68.25) 


The required equation for U* may be obtained by adding the three equations 
(68.21) to (68.23). We calculate the sum of the right-hand sides. By (68.14) 
we have 


Nae feed Ue 68.26 
— + No = = (grad U) (68.26) 
Calculating to the same accuracy we obtain from (68.16) and (68.17) 
Cs 1 
as Nek — oe (grad U)? (68.27) 
and therefore 
“oo 4 © yee 4.4 (grad U)? = 0 38.28 
en says + (gra P= (68.28) 
Using this relation as well as the equation 
Anyy -+- 16nryU = -— 4ryc? (68.29) 
we get for U* the simple equation 
1 o2U* 
AU* — — —_ = — Ary(c? Pf + The) (68.30) 
c2 at? 


Outside the masses U* satisfies d’Alembert’s equation with Euclidean 
coefficients. This could be expected in yiew of the form (68.17) of the Lagrangian. 

At the end of Section 67 we found an approximate expression for the scalar 
of curvature 2 in terms of the function 


G2 a (=2) | (68.31) 


which is proportional to the fourth root of the absolute value of the deter- 


minant g. We have, approximately 


raz (- OL af) (68.32) 


 f8 \e? at? 
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If this is combined with the relation 

R= at T (68.33) 
which follows from Einstein’s sgiaicne and in which 7 is the invariant of 


the mass tensor, the preceding equation gives 


1 @ Ary 
-5 = =—- S/T (68.34) 


Let us compare this relation with the expression (68.30) for U*. We put 


p= (68.35) 


1 
where Us — U +5 (8 — Sex — 30% (68.36) 


as a comparison with (67.09) shows. Like U* the quantity U** is equal in 
first approximation to the Newtonian potential U. Equation (68.34) now 
assumes the form 
27 Pe 38U 
AU** — ee = — tor(1 + =z)? (68.37) 
c 


c® at? 
To the same degree of accuracy we can write 


1 e2t** 
AU** —. 3 a = — Ary {(c? + Ujre — TE} (68.38) 
c 
This last equation can also be derived directly from (68.21) to (68.23) and from 
the definition (68.36) for U**, 
If the invariant T has the form (66.19) we have 


3U 1 3 
(1+ 5) r=e(1 +5 (fo +I] — Sp (68.39) 


By use of this equation together with (68.37) it can be shown that in the approxi- 
mation considered the quantity U** is an additive function of the masses. 
It is easy to see that the mean of (68.20) and (68.36) 


0 = 4\(U* + U**) (68.40) 
satisfies the equation 
1 eu 
0 aye = ee 4 (68.41) 


This equation has only one component of the mass tensor on its right-hand 
side ; it is related to the Einstein equation involving the same component, 
while the quantity U itself is related to g®. Indeed, by multiplying together 
the expressio Lot +U* [*#)\ 2 

Barge vVi-9) = (2 +=] 


00 


(68.42) 
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and remembering that U* and U** differ from one another and from U by 
terms of order 1/c? we get 


7 ] 4 7 
V(—9):9% = g=-+—=0+4-02 (68.43) 
c 3 0 


where U is a solution of (68.41). 


It is not difficult to verify equation (68 43) for g. For if g°° has this value, 
then 


1 1 92,00 rT 
5 hate — os 8 2 4 (nn 1 20) 
c 


2c? ce et? 


+3 ((eaa oy — 2 (=) ‘) (68.44) 


Here we can replace U by U in the correction terms and we can discard 
(0U/at)?. Using the value of N given by (68.14) we obtain 
Le 1U 1 aU 


* Ago gh + Noo — 2 (1 Av 
a8 38 eet =, all ae 


8 — 35a ) (eas 


If we compare this relation with Einstein’s equation (68.18) for y = v = 0 we 
arrive at (68.41). We may note that to the same order of approximation to 
which (68.43) holds we can put 


o-2e8)/6-9 am 


For the case of a static field of a concentrated mass M we obtained a rigorous 
solution in Section 57. A comparison with that rigorous solution shows that if 
one inserts 

yM 
7 


Vee oem (68.47) 
into equations (68.42) and (68.46) they reduce to the exact expressions (see 
(58.10) and (58.13)). The expressions (68.47) agree with d’Alembert’s equation, 
which indeed they must satisfy outside the masses, in the approximate theory 
considered in this section. 
, 69. The Connection Between the Divergence of the Mass Tensor and the 
Quantities Iv 

We have already mentioned at the beginning of the preceding section that 
if in Einstein’s tensor the terms containing the I’ are omitted at the outset, 
the divergence of the mass tensor will vanish, i.e. 


V,T# = 0 (69.01) 
only provided the harmonic condition 
uv 
og” = 0 (69.02) 
Ox 


ue 


is satisfied. In order to study the equations of motion it is necessary to examine 
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the connection between the conditions (69.01) and (69.02) or, more precisely, 
between the left-hand sides of these sets of equations. 

If no terms are neglected the left-hand side of (69.01) may be written as a 
rather complicated differential expression involving the left-hand sides of 
(69.02). However, we are interested not in these rigorous relations but in 
approximate ones that are valid to the accuracy to which Einstein’s equations 
were given in the preceding section. We now pass on to the derivation of these 
approximate relations. 

The expression for the divergence of a symmetric tensor was developed in 
detail in Section 65. According to equation (65.24) the spatial components of 
the divergence are 

eT = aT 
Vad = St ae Toe + TET + yo I + Vy TH+ y,T% (69.03) 
The Christoffel symbols and the quantities y, appearing here have been given 
in Section 67 (equations (67.25) and (67.21)). We now transform those equations 
a little. We introduce the quantity U* following (67.11) and can then replace 
the first equation of (67.25) by 


4U\(aU* 4 
ri, = -(1 2 =z) (= i =, (69.04) 


c? CX; c? ot 


The remaining equations of (67.25) can be written as 


2 (0U ou; 
i= 19 eS ee 
oe = 2 YodR — ( = a) (69.05) 
and 
1 eU 
Te, = 2(ydix + yxdus) — = —— 8x (69.06) 
c* Oz; 


Inserting these expressions into (69.03) we obtain 


: ore = aT tk 
Ve = ey 9 eee + 2yoT? + 2y, TH 


-(1 -S\(S 4 = )re 4 (= oe 2. ey 


c2 x4 ce at ~ @ rE = Ox; c2 Oxy, 


C2 
(69.07) 


[kk 


We multiply this expression b 


the determinant a ae . 
given by y Tminant g which is approximately 


ii o*(1 S| (69.08) 
c , 


and we use the fact that the coefficients of T° and of Tkk 


to each other. We get are nearly proportional 
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a(gT®) | ag) 


V Tw — 9) 
ae ot OE 
aU* 4 0U; ‘ aU, aus 
—— 4+ — —_"}(g2T00 4 Pee) 1 4f 2 * k ; 
( Ba | 8 Ol Jee gare (= a)? 02-09) 


This is an expression for g times the divergence of any symmetric tensor. 
However, if 7’ is the mass tensor the following equations hold : 


1 @?U* 
AU® — — —_ = — {vy 27°00 kk ; 
cat sa Aa a (69.10) 


They were established above (equations (68.19) and (68.30)). With their use 
the right-hand side of (69.09) can be transformed further. We consider to this 
end the quantities N¥’ introduced by (68.14)—(68.16) and note that in the 
gravitational equations these quantities combine with the tensor T+” in the form 


4 
A’ = 9T” — oe Nw (69.12) 
Sicy 
We form the sum of derivatives 
oA’ 
At= (69.13) 
on, 
Putting for brevity 
aU* au 
= -=¥ (69.14) 
ot O23 
we get 
aNe aeNte 2/0U* 4 aU; 1023U* 4 0¥ 
ro Se Se 
ot OLE c4\ day c? at c2 at? c? at 


é or 
+5 (Fe Fe) (4% -Z) (69.15) 
OX Ox, 


Since to first approximation the relation 
aU Us 
iSclace + — 
ot OCLs 

is valid (see (55.42)) the quantity ‘¥ will be at least of order 1/c? so that in 


(69.15) terms containing derivatives of ‘’ may be neglected. Discarding such 
terms and using (69.10) and (69.11) we obtain from (69.15) 


=0 (69.16) 


ct (aN — aNt 
Srey ( ee sar) 
1aU* 4 BUN amon, amr 4 (QUE sty 
eas Uae et) pene ae —— — ——|T%* 69.17 
= ae c? x er gaa ae Ory oxy ( ) 


Now, the right-hand side of this equation coincides with the additional terms 
on the right of (69.09). Thus the latter are expressible as sums of derivatives. 
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Therefore, if 7’ is the mass tensor we have 
vr = 2 (gp © yo) 4 2 ( qe ne) (69.18) 
ae = (9 Sry OX, J 8rry 
On the other hand differentiation of the gravitational equations written in the 
form (68.18) gives 


1 @ agi agt* 
(a =a zi) (= + a) 
16ry {0 ee a ( é* 
= — {— (gT? — —_ N® — (gT* — — N& 69.19 
a fe (0 Sy) | Ome 9 Bry i098) 
Comparing the last two equations we may write 
1 &)\ (dg? dg 16rryg 

seg St, Be | Vote 69.20 
(a c? =) ot a =| é e ( ) 
This approximate relation plays an important role in the derivation of equations 

of motion. 


A similar relation may also be derived for the zero component of the diver- 


gence of the mass tensor. Differentiating the approximate Einstein equations 
(68.18) we find 


2 op ou. 
1 (s 1@ ) dg™ aN% Bry a (gT%) (69.21) 


2e\" 8 at 
To evaluate the second term of the left-hand side of (69.21) we differentiate 
the expressions (68.14) and (68.15) for N° and N%, Using (69.16) we get 
oN°o @eN% 6 0U 8 aU 
2 ee ae — — AU 69.22 
at 0x4 c&§ at el c§ 0x4 ‘ ( ) 


Here we may express: the quantities AU and AU, in terms of T and T% 
by use of (68.25) and (69.11). Then we have 
aN Bry U 4 _ B2ry WU ry 
Ox, c* sat c* aay 
Inserting this into (69.21) and using the expression (69.08) which gives the 
determinant g in terms of U, we obtain after multiplication with 2c 


on, Ox, c4 ox, 


(69.23) 


1 6%\ag"% léry /aT™ 1 OU 
A — —=—|—— = — gf —— + — — 7 
“sala, ole tam) 2 
Instead we may put 
1 o2 dg™ 16ry" 
Ay ea Re ee ou 
( rs =a) ae gV LT (69.25) 


for, by (65.23), the zero component of the divergence of the mass tensor is 
equal to the bracketed expression on the right-hand side of (69.24). Equation 
(69.25) has a structure completely analogous to that of equation (69.20) for the 
spatial components of the divergence of the mass tensor. 

Both equations may be combined as 


1 8) dg 16 ry 
A—~=—)—- = — uy 
( Sag (69.26) 
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As was stressed at the beginning of this section these equations are approximate. 
They are remarkable because the differential operator on the left-hand side has 
constant coefficients ; consequently the equations can be very conveniently 
investigated. 


70. The Equations of Motion and the Harmonic Conditions 


The problem of the motion of a system of masses has already been formulated 
generally in Section 64. Now we must investigate the form of the equations of 
motion. 

We know that if each moving body is regarded as a continuous medium then 
within each body the equations 

V,2e = (70.01) 
must be satisfied. On the other hand one can consider the motion of each body 
as a whole and characterize it by a finite number of parameters such as the 
coordinates of its mass centre, the values of its mass and its moments of inertia 
and so on. 

The problem then arises to find the equations of motion of bodies as a whole, 
i.e. to find differentia] equations for the parameters characterizing each body. 

We have just enumerated some of the parameters characterizing a body. 
What is it that determines their choice? In the first place it must be required 
that the parameters chosen are sufficient to find the forces which are exerted 
on other bodies and which determine their motions. In our problem the forees 
in question are those of gravitation. Therefore, the parameters characterizing 
a body as a whole must be so chosen as to permit a sufficiently accurate deter- 
mination of the gravitational field produced in the region containing the 
Temaining bodies. Since the bodies are supposed to be at great distances from 
each other the parameters characterizing the body must then satisfactorily 
determine the gravitational field at large distances from it. The parameters 
listed above have this property. 

What has just been said applies to Newtonian mechanics just as much as to 
Einstein’s mechanics. Since in Einstein’s theory, gravitation is related to 
the metric and it is the nictric that has immediate influence on the motion of 
bodies, the problem of finding equations of motion for a system of bodies must 
be solved in conjunction with the study of the metric at large distances from the 
bodies. Both the choice of parameters and the form of the equations of motion 
must be subject to the requirement that at large distances from each body the 
metric should be given correetly. 

At the cnd of Section 69 we derived the approximate relations 


1 @\ég = 16ry 
— — —)—— = —_ gV,T” 70.02 

(a ce mn) er, a Ite ( ) 
They follow from Einstein's equations written in the approximate form (68.18). 
In the rigorous solution both the right-hand side of (70.02) and the cxpression 
acted on by the d’Alcmbert. operator on the left-hand side vanish by virtue of 
the equations (70.01) and the harmonic conditions 


OV =i (70.03) 
or 


u 
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If, however, the solution considered is approximate, the conditions (70.02) 
establish a relation between the order of approximation in which the equations 
(70.01) are valid and that in which the harmonic conditions (70.03) hold. 

We have scen that in seeking the laws of motion of bodies as a whole a basic 
requircment is that the metric at large distances from each body is given 
correctly. This requirement means in particular that at large distances from 
all bodies the harmonic condition (70.03) must be fulfilled with greatest possible 
accuracy. Let us consider what consequence this has for the right-hand side 
of (70.02). 

These equations have the form of the usual equation for a retarded potential 


Ay — —~— = —4no (70.04) 


but it must be remembered that in the very process of establishing the equations 
approximations were made that are equivalent to an expansion of the functions 
involved in inverse powers of the speed of hight. Therefore it would be sufficient 
in this case to solve (70.04) approximately. 

Nevertheless, we begin by examining the exact solution. We assume that 
the function o on the right-hand side of the equation, which we shall term the 
density, differs from zero only in a limited region of space in the neighbourhood 
of a point having the coordinates 


a= ait) (i= 1,2, 3) (70.05) 
(This is the region occupied by one of the masses.) We are interested in that 


solution of the equation which corresponds physically to the potential pro- 
duced by the moving mass of density o, ie. the retarded potential. This solution 


has the form ; [o] dV’ 
i | ae (70.06) 
where [c] is the retarded value of the density, i.e. : 
, ‘ ‘ 1 i 
[o] = oft’, r’); Cis |e | (70.07) 


The integration is over the coordinates (z’, y’, 2’) and extends over the region 
occupied by the mass. 
We now pass to the approximate equations. Expanding [oc] in inverse powers 
of the speed of light c and retaining only the first few terms, we get 
rp o Al” ld 1 @ 
el (mace Peeianee ran eee 
Seera “al Yt yaa 


t 


C~ 


| Jr —r'JodP” (70.08) 
where now , 
c = a(t, r’) (70.09) 
This expression can be further expanded in inverse powers of the distance 
from the mass in question. 
Using the expansions 


eee ae (x, — a,)(x; — a,) 
[r—r’{ |r--a |r —al +... ae 
Pp] =[r—a[— Sa oa) | 


ips 
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and putting | o(t,r’)dV’ =u 


: (70.11) 
[ ol, Fa; — ai) dV’ = ps 
we obtain 
ee eee pi(xe — At) 1 du 
» [F—a|* |r —al? Sa 2 
1 @ pa(xe — a4) = 
taagatl? ~al—Ae t (70.12) 


The quantities p and yp, may be called the “moments” of zero and first 
order. It is casy to see that if one puts yp. = 0 the terms not containing c will 
decrease at least as 1/ |r — a|?, the terms inversely proportional to c will vanish 
and the terms proportional to 1/c? will remain bounded. If, in addition, it is 
postulated that the first moments vanish, i.e. u; = 0, the terms next in order of 
importance in the expansion (70.12) will also vanish. If the second moments 


| o(x4 = a1)(LE = ax) dV = Pik (70.13) 


are also zero, further terms vanish, etc. In general not all these conditions are 
independent. But we can impose as many independent conditions as we have 
parameters at our disposal. 

We assume now that there are several masses so that the density o differs 
from zero in the vicinity of scveral pomts, say , 


Mo a(t) ; y= b,{t) nati (70.14) 


Then the samc reasoning will apply to each point. Using the notation 


yi) = [ o(t,r)dV; pO) = | o(t,r) dV... (70.15) 
(a) (b) 
and oS ‘= a(t, r)( —a ) dV ; ui?) = a(t, r)(z. = b,) dV (70.16) 
t 4 4 7 
(a) (d) 


for the integrals (70.11) extended over the regions occupied by each of the 
masses separately we find for } the expression 


wr(z, —a) 1d (a) 
ve > eoatt Da |r —ale ~ edt De 


(Hla — 
+> Hole al We 4 (70.17) 


2c? dt? |r —al 
a 


In order that ) should be small at any point between masses and also at a 
large distance from all masses we may now require that a series of equations 


pO =0, p= (70.18) 


, baat Bb] 


should hold, each of which refers to a single mass. 
After this general discussion we return to the study of equation (70.02). 


266 The Theory of Space Time and Gravitation 


We shall try to satisfy the condition that at any point between masses, and also 
at a large distance from all masses, the harmonic condition (70.03) be satisfied 
with the greatest possible accuracy. 

When equations of motion are stated in four-dimensional form the equation 
expressing energy balance usually appears as a consequence. of the remaining 
equations of motion. Therefore we shall first deal with the spatial components 
in the relation (70.02), corresponding to v = 1, 2,3, and shall later verify that the 
conditions involving the temporal component v = 0 are also satisfied. 

Comparing (70.02) with (70.04) we may put 


o =o, = —gV,T* (70.19) 
c& ag 
d —— === . 
an y ve ty an, (70 20) 


To make the quantities (70.20) small outside the masses it is necessary to satisfy 
a set of conditions of the form (70.18). In the first place we require the following 
integrals over the volumes of the separate masses to vanish 


po = — { gV 1 “dz,dx,dx, = 0 (70.21) 


But, by (69.09) 


a a au* 4 aU, 
VT = —(gT) + —(gTtk (= a) 
9 PAG )+ ar ee Be oe (77T™.-- TF) 
aU, Ue 
rh (incl Ama 
+ ( : 2) (70.22) 


Therefore the equations (70.21) may be written as 


d , aUu* 4 aU. 
Pos 40 3 . ¢ 
zi | gT(dz)3 = | (= + a Ferre + T*)(dz)3 
(a) (a) 
aU, au, 
Se | (Fe = 5a) T*(dax)8 (70.23) 
where we have put for brevity ° 
(dz) = da, dz, di, (70.24) 
We should also remember that 
| g=—c? —4U (70.25) 
Expressions for the components of the mass tensor of an elastic body were 
obtained in Section 66. According to (66.07) they have the form 


1 
em — oll +500 + — 0) 


= 1 1 
erm en + oa (20? +I — v)| — 5a Penre (70.26) 


oe? Ttk — ove — pix 
If we confine ourselves to leading te i i 
th i 
aT nas : g terms : a of inserting (70.26) into 
— | ov,(dz)? = — 
i| e (dz) | ae (dx) (70.27) 


a) 
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Hence it is clear that the relations (70.21) represent the equations of motion 
of the mass centres of the various masses. If the number of masses is n there 
will be 3n such equations. This will be the number of degrees of freedom of 
our mechanical system if the masses are treated as point-like. 

Passing on to the discussion of further conditions of the form (70.18) we can 
postulate that the following combinations of first moments vanish 


up = — [ 9la,V,To* — 0,V,Tat\(da)? = 0 (70.28) 


If here also we confine ourselves to the main terms, equations (70.28) assume 
the form 


d a a 
i | O(L%% — rev4)(dx)? ah once — Zp tae (70.29) 
Evidently they represent the law of the variation of angular momentum for 
each body. Here the angular momentum includes both the orbital moment 
arising from the motion of the mass in its orbit and the intrinsic moment arising 
from its rotation. The orbital moment can be separated out by suitably com- 
bining equations (70.27) and (70.29). The number of equations of the form 
(70.28) is also 3n, therefore if it is assumed that the masses rotate as rigid 
bodies the number of equations is equal to the number of rotational degrees 
of freedom. 


71. The Internal and the External Problems in the Mechanics of 
Systems of Bodies. Newton’s Equations for Translational Motion 


In the following we shall distinguish between the internal and the external 
problem of mechanics. We shall relate the equations of motion within a body to 
the internal and the equations of the motion of a body as a whole to the external 
problem. 

We shall consider in various approximations the influence of the internal 
structure of a body on its motion as a whole. 

We saw that, in order to derive the integral forms (70.27) and (70.29) of the 
Newtonian equations of motion, it was sufficient to use the equations of the 
internal problem to the extent of the equation of continuity 


GE si AOD carp (71.01) 

al Oxi 
which determines the zero approximation for some of the components of the 
mass tensor, namely 

(cd alee c2T% — oy (71.02) 

Thus, in the approximation in question it was hardly necessary to make any 
assumptions concerning the internal structure of the body. Incidentally, a 
more detailed discussion of the internal structure of a body is itself impossible 
without knowledge of the metric to a corresponding accuracy. If one calls 
the Euclidean metric the zero approximation, then the first approximation 
beyond it already requires the introduction of the Newtonian gravitational 
potential U and also of the vector potential U;. This is what was done in Sec- 
tion 55 on the basis of the expressions (71.02) for the mass tensor used there. 
Physically, the first, approximation to the metric corresponds to taking account 
of the force of gravity in the study of the internal structure. 
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In Section 66, when we derived the mass tensor (as given above in (70.26)) 
we required not only the first approximation to the metric but also some 
definite assumptions as to the internal structure of the body. In fact we took 
the body to be elastic. In addition to the equation of continuity (71.01) we 
had to assume that the Newtonian equations of motion were satisfied within 
the body : 

Out ov; aU Optik 
(3 na) es eras (71.03) 

The equation of continuity for the internal problem enables one to derive 
the Newtonian equations of motion (70.27) for the external problem. On 
the other hand the non-relativistic equations (71.03) of the internal problem 
and the mass tensor (70.26) based on them ensure that in the external problem 
one can obtain the second (relativistic) approximation to the equations of 
motion of the body as a whole. This is quite natura], since the non-relativistic 
equations give the distribution of energy and of related quantities within the 
body, and energy possesses mass which influences the gravitational field in 
relativistic approximation. 

Our next problem is to obtain in an explicit form the equations of motion 
that follow from the integral relations established at the end of the preceding 
section. 

To construct these equations it is first of all necessary to state what degrees 
of freedom it is intended to study, or in other words by what parameters the 
mechanical system is to be characterized. The considerations that determine 
the choice of parameters have already been stated at the beginning of the 
preceding section. Based on these considerations we shall discuss those 
degrees of freedom which correspond to, first, the translational motion of each 
body separately and, second, the rotation of each body about its mass centre. 
In doing this we shall assume that the bodies rotate as if they were rigid. (Of 
course, this does not mean that the bodies are assumed to be rigid ; fluids 
can also perform such a motion.)} 

By virtue of the equation of continuity the mass of a body 


Ma= { o(dzy? (71.04) 

(a) 
is. a constant ; therefore it is not to be numbered among the variable para- 
meters. The translational motion of a body is determined by the change in the 


coordinates a; of its mass centre. These quantities may be introduced by the 
relations 


May = | ox,(dx)? (71.05) 
which may also be written as me 


| ela — an(dxys = 0 (71.06) 
(a) 


t We note that our calculation is of an approximate nature. The difficulties connected 


with defining the conce t of & TL id O n the h Vv. oO not et arise In the 
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Differentiating the integral (71.05) with respect to time and using the equation 
of continuity, we obtain 


d 
7 | ex,(dx)? = { 7 z(dx)? = — | = a(dx)? = | ev,(dx)? (71.07) 


J 


whence | pv(dz)? = Mad; (71.08) 
(a) 


It follows from (71.08) that regardless of the distribution of velocities within 
the body the left-hand side of equation (70.27) is equal to the product of the 
mass of the body and the acceleration of the mass centre. ~ 


d 
= | ov(dz)® = Mads (71.09) 


Let us calculate the integral on the right-hand side of (70.27). The Newtonian 
potential U may be split into two terms 


U(r) = ua(r) + U(r) (71.10) 
Here wg is due to the mass M, while U‘ is due to all the other masses. In 
accordance with this splitting we get 


aU bua au® 
— (dz)? = — (dx)? ——— (dx) 1.11 
[es@er= fetta + fea may 
(a) (a) (a) 

Within the mass M, the potential ug satisfies the equation 

Aua = — 4rypa (71.12) 
where pz is the density associated with that mass. The potential may be written 
as the. integral 


fe a = (71.13) 
On the other hand the potential U ce which originates in the other masses 
will be a slowly varying function of the coordinates within the mass under 
consideration and it may be expanded in a Taylor series in powers of (x; — ay). 
This makes it possible to evaluate approximately the second integral in (71.11). 
Inserting the expression (71.13) for ug into ane first of the integrals we get 


: a ata fl eae — xj)(dx)(da')> (71.14) 


Cx; 
(a) (a) (a) 


but the double integral on the right is zero because the integrand is anti- 
symmetric in the coordinates of the two points. Thus 


i : os (dzy> = 0 (71.15) 
Lt 


(a) 
This relation can be interpreted as the statement that the resultant of the 


internal gravitational forces vanishes. 
We give another proof of this relation. We put 


g@ = Brx(grad ua)? — — — (71.16) 
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Then 


Oe a Ay, (71.17) 
Ox, Ox4 

or 
1 og Ma (71.18) 


Ary “OxE Pa CL4 
as a result. of (71.12). If in the integral (71.15) we take the density p to be 
ea, the density of the particular mass under consideration, we can extend the 
integral over the whole volume of space, which gives 


ou OU, 1 lee 
Ss ee 71.19) 
| ea Bat, = (dx)? = Pa =— BE: trey A Ga, | ( 


(a) (20) 
since at infinity the g{? become zero. 
We now turn to the evaluation of the second integral in (71.11). Expanding 
the potential U' of the external masses in a Taylor series we get 


eum aU - 1 92qJ a) 
= () (~—) (te — ax). 
0x4 Ox; a ex on - 
1 Ua) 
2\ day Oxy Oxi) a" - Le (71.20 
ss Ore wz) (tu @x)(x1 — a1) + ( 


Multiplying this expression by p, integrating over the volume of the mass 
(a) and using (71. Ge we obtain 


3 _ J@{— ee 71.21 
i ° Ba, ~ (az) ud Ax “). +3 Th (= Ore =), . ( ) 


{ 
where J denotes the quantities 
Ip = | (a)p (we — ax)(a1 — ai)(dz)® (71.22) 


which we shall call the moments of inertia of the mass (a). (In mechanics this 
name is applied to somewhat different quantities.) We now have to find the 
value of the potential U'), We have 


Ulan) = > | —? _ (dx')3 (71.23) 
Pe 
rn () 
where the prime attached to the summation sign signifies that the mass (a) 
is excluded from the summation. Each term in the sum represents the potential. 
due to the corresponding mass. We consider a single term such as 


-_ e'(da’) 
ur(r) = | ror] | (71.24) 
We insert the expansion 
1 1 ; 7) 1 
oF] =e ae FB 
1 : o 1 
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integrate term by term and use (71.06) and the definition (71.22). This gives 


yMo 1 a? 1 
= SS ee ee : 
me) |r —b| a 2H! Gay On |r —b| + 28) 


Therefore we have 
, M 1 ’ oe 1] 
Urey) = > en +5 > +... (71.27) 
b b 


Let us estimate the order of the terms in the two sums occurring here. Let 
I be a length that characterizes the linear dimensions of the bodies and R a 
length of the order of magnitude of the distances between them. (See Section 
64.) Let q characterize the order of magnitude of the velocities of the bodies. 
We shall take it that in orders of magnitude 


yM 


gris (71.28) 
The order of magnitude of the moments of inertia will evidently be given by 
In ~ ML? (71.29) 


therefore the first sum in (71.27) will be of order g? and the second of order 
q? L?/R?. The omitted terms indicated by the dots will be of yet higher order 
in the small quantity L/R and we neglect them. On the other hand it is readily 
seen that in (71.21) the second term will be of order L?/R? relative to the first. 
Therefore when calculating the second term we can discard terms in U‘) that 
are of order q? L?/R? whereas they must be retained in the first term. With a 
little manipulation we get 


aU) a ~wv {yu oe 1 
fo ans = 2 a ane + MI) 
b 


ont Bai |a —b| Gaz 2a; |a —b| 
” (71.30) 
Since the quantity . 
oe? 1 ae Sk 3(ax =e bx)(a = bi) (71.31) 
Gay Oa; |a —b| |a —b|? |a —b|5 


is symmetric in a and b the expression in curly brackets in (71.30) will also 
be symmetric. Introducing the double sum 


M el 
6 peated + F(M,IQ + MQ) ——— en (71.32) 


2 < |a —b| day Ga; |a — b| 
(a¥b) 
we can therefore write 
aU aU @) o® 
as 3 — —— (dz? = —— 71.33 
[eZ cane = | oS ar =F (71.33) 
(a) a) 
Equation (70.27), which we restate, 
d aU 
_ 3 — (dz)? 71.34 
© [coud = | oo (an (71.34) 
(a) (a) 
then assumes the form 
om 
Mase] (71.35) 
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The quantity ® is evidently the Newtonian potential energy of our mechanical 
system expressed in terms of the coordinates and the moments of inertia. We 
note that-enr derivation has as yet not required any assumptions concerning 
‘the distribution of velocity within the body, apart from what is implied by 
(71.01). 

If all bodies under discussion possess spherical symmetry the inertia tensor 
Ix; of each body will be proportional to the unit tensor 


Te = [8x1 (71.36) 


In that case the potential energy will not contain the moments of inertia at 
all, but will depend only on the coordinates a; (and, of course, on the constant 
masses M,). Then the system of equations (71.35) is fully determinate : it 
contains as many equations as unknown mass centre coordinates. It is just the 
usual system of equations for the motion of a set of mass points subject to an 
attraction given by Newton’s Law. 

In the general case, on the other hand, when the bodies are not spherically 
symmetric the equations of motion involve not only the mass centre coordinates 
of the masses but also their moments of inertia, and the system of equations 
(71.35) is then not fully determimate. In that case a determinate system of 
equations may be obtained if it is assumed that the bodies rotate about their 
mass centres in the manner of rigid bodies. This means that the distribution 
of velocities within each body has the form 


v, = & + ox, —a,) (71.37) 
where w{? is the three-dimensional antisymmetric tensor of angular velocity of the 
body (a). Omitting the label (a) we can alternatively write its components as 


Og3 = 1; O31 = W2; O12 = W3 (71.38) 
Under such an assumption the complete system of equations is obtained by 


adjoining to (71.35) some equations which prescribe the rates of change of the 


angular momentum of each body. The derivation of these equations will be 
given in the following section. 


72. Newton’s Equations for Rotational Motion 
We put 


MY = | e[(x, — a,)u, — (% — a,)v,](da)? (72.01) 

(a) 
Evidently, this is the angular momentum of body (a) referred to its own mass 
centre. The law governing the change of this quantity can be obtained from 


(70.29) after separating terms referring to the orbital angular momentum. 
This is effected by use of the relation 


d 
zi {a { ovx(dx)® — az { en(da = 4; { ° ae (dx)? — ay { ) al (dx)? 
(a) (a) ae a 
a (a) (a) 


(72.02) 
which follows from the equations of motion (71.34) and from equation (71.08), 
by virtue of which 


a [ pue(dx)? — dy | pvd(dx)? = (72.03) 
(2) (a) 
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We restate equations (70.29) 


d aU 7) 
a | (Lie — xxr1)(dz)? = i o( x 7 ad (dz) (72.04) 
(a) (a) oe - 


Subtracting from them the relations (72.02) we get 


d 
di { el(x: — a4)ve — (xe — ax)vi}(dz)? 


(a) 
oU aU 
— —? ‘ So =< ree eel 3 
of (x a) 5 — (tte — a) = | (da (72.05) 
G, 
or, using the definition (72.01) : 
d oU aU 
— M@® — ee fo =2 =: 7 
a it a ai) on (x — Gx) se, | (ae (72.06) 


(a) 


We now calculate the quantity A/{®. Inserting the expression (71.37) for the 
velocity into (72.01) and using the definition (71.22) for the three-dimensional 
inertia tensor we get 
MO = ol — ofl (72.07) 

In more detailed notation these relations appear as 

Moy = (Ln2 + 133) O23 — LyeWar — 113642 

May = T2053 + a3 + Lis)ar — L291 (72.08) 

M yy = —L 4323 — L391 + Lr +L e2)ore 
For brevity the label (a) has been omitted. These are the well known equations 
in the mechanics of a rigid body, which relate angular momentum to angular 
velocity. 

We now evaluate the right-hand side of equation (72.06) ; it evidently rep- 
resents the moment of the forces acting on the body. In the sense of the separa- 
tion into external and internal forces that was given in (71.10) let us first consider 
the moment of the internal forces. With the aid of (71.18) it is easy to verify 
that it vanishes. For by the same argument as was used to derive (71.19) we 
obtain the equation 


OUa Qua 
PE cel, ae sa gah eee NS 
| f(x a) oan (xp — Gx) oe: (dz) 
(a) 


1 3 .. 
} aur; {(4, — a) — (%, — %)8P Hazy? =0 — (72.09) 


0) 


arr pea 
uae 


The moment of the external forces remains to be calculated. Using the expansion 
(71.20) of which only the constant and ine linear terms need be retained, we get 


Um ou a) 
{ a — a) — (4% — ax) a | aay 


Or k 
(a) 
2T] (a) 20 (a) 
o ro ( 2 a 19(5 e ) (72.10) 
u ork Ox; a OL4 Ox; a 


We insert into this the expression (71.27) for U™, of which only the first sum 
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need be retained. This gives 
aU @ au 
i o| (x a) (te — a) 
(a) 
’ e? 1 oe? 1 
_ (a) a (() aan CO 
Sng Gaz Gay [a —b| *I ag day |a —b| ( 


Since by (72.09) the moment of the external forces is equal to the total moment, 
we may replace U‘ by U; then (72.11) gives the right-hand side of (72.06), 
so that we may write the law for the rate of change of the angular momentum 
of body (a) asf 

d r a? 1 che 1 
as (a) — (a)_ LO) Ln 72.12 
dt Mi > ym, {Ig Caz Oay la — b| kj Oat 0a; [a = b| ( ) 


| ae 


Introducing the explicit expression (71.31) for the second derivatives we can 
also write 


d 1 3yM(a; — b 
= Ue = > ee La, — LP — (a, —b)I@] (72.18) 
b 


These equations supplement the previously derived equations of motion of the 
mass centres 


Mat = — — (72.14) 


and render the system of equations determinate. That this is so may be seen 
in various ways. We can consider as our unknowns characterizing the motion 
of each mass the quantities 


a 19, «w@ (72.15) 
twelve quantities in all. (The components of angular momentum can be 
expressed in terms of these by equation (72.07)). For these quantities we have : 


the three equations (72.14) (the mass centre motion), the three equations 


(72.13) (the law describing the change of angular momentum) and, in addition, 
the six equations 


ae 1@ + @@y@ 
at — Og jk ae Ox i (72.16) 


which hold for any tensor whose components are constant in a coordinate 
system fixed to the body. (These equations will be derived below.) Thus we 
have twelve equations for the twelve unknowns, so that the system of equations 
is fully determinate. 

We could also have argued differently by considering for each body a co- 
ordinate system (zy, 27, 23) rotating with the body (we again omit the label 
(2) for brevity). Denoting by «,, the direction cosines which satisfy the 
relations 


Orectry = 84g; kr tgs = Sys (72.17) 


+ This equation was first derived from Einstein's e uations (by anoth 
V. P. Kashkarov [39]. q (by another method) by 
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we may put 


i Op 4(, — a); L, — a, = a, ,2* (72.18) 
The derivatives of the direction cosines are related to the components of the 


angular velocity by the relations 


re = Arjje 5 jt = Ar jOri (72.19) 

By well-known relations in the kinematics of a rigid body the nine direction 

cosines may be expressed in terms of the three Eulerian angles 3, © and y. 

For each of the masses we could have replaced the unknown functions (72.15) 
by the six quantities 

By, Ay, Az, '@), gla, yo (72.20) 


and could have expressed all other quantities in terms of these, in particular 
those given in (72.15). For instance, (72.19) already expresses the angular 
velocity in terms of the direction cosines and their derivatives. As for the 
moments of inertia, they may be expressed as 


Tyg = typ%q1% (72.21) 


sj zs 


where the /* are the constant values of the components of this tensor in a 
coordinate system fixed in the body. For the six quantities (72.20) related to 
each mass we would then have six equations (72.13) and (72.14), i.e. the required 
number of equations to make the scheme determinate. 

The calculations just described of the equations of motion in Newtonian 
approximation presuppose the possibility of discarding (in addition to relativis- 
tic corrections) all terms of higher order in the small quantity L/R. If we wished 
to retain these terms we would have to discuss third and higher moments 
such as 

ikl 
(a) 
However this would not destroy the determinateness of the system of equations, 
for we could regard the additional quantities introduced as functions of the 
Eulerian angles, e.g. 


1@), = i e(x, — a,)(2, — a,)(2, — a,)(dx)3 (72.22) 


Lig = aX seu! rou 
where the 7*,, are constants. Then the quantities (72.20) would remain the 
only unknown functions, so that the number of these would not increase. 
Using the other procedure we would include the newly introduced quantities 
among the unknowns and correspondingly we would augment the system of 
equations by writing down, e.g. for the quantities (72.22) the equations 

d ) 9. 

Ti 1@, = wP1% + GID) + APL, (72.24) 
In the general case of an arbitrary three-dimensional tensor Ati, - - - i, having 
the constant components Az,, ...r, in the system fixed to the body we would 
have 


= Anis oe ety 0, Aj, tgs os ty a5 onda, te thoy fs thy os th 


+ cpt, Atys tore typ g§  (T2.2 
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These equations follow directly from the transformation equation 
(72.26) 


* 
Ay, Ig + = Ory trots « OrninAry re stn 


in conjunction with (72. o 
In conclusion, we verify that the law of conservation of energy is satisfied 
for the system of equations (72.13) and (72.14). We introduce the rotational 


kinetic energy of a body about its mass centre 
7, = LoPoP 1? = MPP (72.21) 


When forming the time-derivative of Tz one must bear in mind that by (72.16) 


dl 
wMuio —H 7 = 0 (72.28) 


so that in the differentiation of 7, the J Sy may be treated as constants. Remem- 
bering this, it is easy to obtain the relation 
ay yy Ail 
Ge pee at 
If we introduce the expression (72.12) for the derivative of M®) into this 
equation and if we use the antisymmetry of = wo we get 


— 1 
Sa 2 >, 
= W; a. yu b I@ 8a; Gay —— [a _ b| (72.30) 


or, after sma with aaieck to 7 and k and subsequent relabelling 


a 1 
M,( wPT@ + ohD[@ 
= 5D ah pk 3) 3a, dag [a — b| 


(72.29) 


(72.31) 
or, finally, by > 16) 


aT doe ss 
dt =3 > dt Ga; 8a, [a — b] eae 


We recall that i expression na for the potential energy was 


yMaM by a 1 
=-5 > ices =p 5 (MI + MLD) sae [ant] ; (72.33) 


(ab) 
The quantity ® depends on time through the coordinates a; and through the 
moments of inertia /{. This latter dependence leads to the following terms in 
the total derivative of ® with respect to time: 


d® oD dT a 

(ame al cama ve 
this may be seen by comparing (72. 33) and (72. 32). Hence using the equations of 
motion for the mass centres we get 


d 
5 Z (AA1,a2 + 7) + o| =0 (72.35) 


and, therefore, we have the law of conservation of energy in the form 
2~GM,@+T,) +0=E (72.36) 
a 


where E is the energy constant. Ore should note that even for a very fast 
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rotation of the bodies, when the linear velocities of the translational and the 
rotational motions are of the same order, the rotational terms in the equation 
of energy balance (72.34) still remain smaller by a factor L/R than the main 


terms, while in the integrated equation (72.36) they are of the same order as 
the main term. 


73. The Internal Structure of a Body. Liapunov’s Equation 

At the end of Section 70 we found the second approximation to the equations 
for translational motion in integral form. We now write them down in more 
detail. By inserting the components of the mass tensor of an elastic body (70.26) 
into the equations of motion (70.23) we obtain 


d - ] 1 
i | {em a oe (302 + TT + 30)| = + Paws (ds)? 
(a) 
ouU* 4 0U 
~ (a SEV + Segoe 4 — 1 — Bhaey 
(a) 


OX: 
4 ( (aU, 3 
+5 {( tS) coy( dt) (73.01) 
(a) 


Oxy x4 


According to (68.30) the quantity U* appearing here satisfies the equation 


1 a2U* ° 
5 o- —Are0 + 4 (qo +11 —U) — P| (73.02) 


In first approximation U* is equal to the Newtonian potential U, but we require 
U* in second approximation, including retardation corrections and including 
terms arising from the additional terms on the nght of (73.02). On the other 
hand it is sufficient to know the first approximation to the quantities U; which 
satisfy the equations 


AU* 


AU; = —4ryeu (73.03) 

We have pointed out in Section 71 that in order to obtain from (73.01) the 

explicit form of the relativistic equations of motion it is necessary to consider 

the internal structure of the body in the Newtonian approximation described 
by the equations (71.03) 

(5 5 a) oU Opik 


Oxt Oar 


(73.04) 


We confine ourselves to the study of those cases in which the body rotates as 
a whole, in the manner of a rigid body. Then (see (71.37)) the distribution of 
velocities within thé body has the form 

vy = a + wyi(z; — a;) (73.05) 


(We again take the label (a) on w; for granted.) Hence the acceleration of a 
particle in the body 


Ovi Ov4 
=— — 73.06 
aa ea en 
is given by 
wy, = Gi + (je — WrK@yx)(Zi — Gy) (73.07) 


We again split the Newtonian potential U appearing in (73.04) into an internal 
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and an external potential, v¢ and U‘) respectively and replace the latter by the 
first term in its Taylor expansion about the point 2; = aj. We also insert the 
expression (73.07) for the acceleration w;. Then we get 


. (=) Cua ~. 
aig ext Ja ° eat 


lial apn 
dt — OF — {——_] | (a — a;) = —_ 73.08 
cs ofan OUROK ( Bi du,) a | i) ee E08) 

We now estimate the order of magnitude of the various terms, retaining 
only the leading ones. A simple consequence of the Newtonian equations of 


motion (71.35) is that 
2 


fie ‘i (73.09) 


The quantity (0U‘@)/éx,)q in the second term of (73.08) will be of the same 
order as d:, but the difference of the two quantities will be small. A comparison 
of (71.21) and (72.33) shows that 

. aU® q? L? 

ii, — (=), ~E es (73.10) 
(For almost spherical bodies this difference will be even smaller because its 
value is determined by differences of moments of inertia and not by the moments 
themselves.) We can therefore take the first two terms of (73.08) to cancel. 
We now proceed to estimate the terms in the square brackets. Here the main 
term 1s ox w x ; it has the order of magnitude of the angular velocity squared. 
We assume that the order of the angular velocity is given by 


mee 7 (73.11) 


The order of magnitude of the time derivative of the angular velocity is deter- 
mined by the rate of change of the angular momentum. It is easy to obtain 
the estimate 

2 


~ 4 
Comparing this with the estimate for w we get 
; cL? 
ao ~ w? R (73.13) 


Further, the second derivatives of the external potential with respect to the 
coordinates will be of the following order : 


( e2U@) q? 
eee =). er (73.14) 
i.e. of the same order as &. Neglecting terms of this order} we retain in the 


square brackets only the term 4c; With these simplifications we can write 


t This approximation is made only in the equations of the internal problem (see (80.13)). 
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the internal equations of motion (73.08) in the form 


Qua Opik 
(ae H+ OtKeoge(Zy — a)| eT 0 (73.15) 
Let us put 
Ua + SOKO sE( Ly = a4)(x; ag aj) = Va (73.16) 


This is the sum of the gravitational potential and the potential of the centrifugal 
force. Equation (73.15) may be written as 


Va pik 


This equation can be satisfied on the assumption that the stress tensor within 
the body reduces to an isotropic pressure p : 


Pik = —pBix (73.18) 


(a liquid always satisfies this condition). If we had not neglected the quantities 
© in (73.08) we would have been obliged to consider non-diagonal elements 
in the stress tensor pi. Indeed it is evident that a change of the angular velocity 
of an elastic body must give rise to stresses that do not reduce to an isotropic 
pressure. 

Under the condition (73.18) equation (73.17) reduces to the following 


OVa _ op (73.19) 


The equation — — — —— =0 (73.20) 


follows from it and shows that V, and e must be connected by a relation not 
involving the coordinates, so that if p is a function of one parameter «, the 
potential Vz must be a function of the same quantity 
e= (x); Va= Vala) (73.21) 
The internal potential wa is a functional of p ; inserting its explicit expression 
into (73.16) we obtain 
dy’)3 

Y oT + 4 iKed4 (24 S ayz)(x4 = ay) = Va (73.22) 
Our problem doneiats in finding the shape of the body, i.e. the shape of the 
region of integration (a), for a given density p- This shape must be such that 
condition (73.21) is satisfied at all points within the body, i.e. such that the 
value of the left-hand side of (73.22) depends only on the density, or on the 
parameter on which the density also depends. In particular, the conditions 
a = const. and Vz = const. must hold on the surface of the body. 

If the body does not rotate, so that wix = 0, all conditions are evidently 
satisfied by a spherically symmetric distribution of density over a spherical 
body. In the case of rotation (wiz 4 0) the problem of finding the shape of the 
body is a very difficult mathematical problem which has been studied by 
many mathematicians. The most complete results were obtained by A. I. 
Liapunov who studied the near-ellipsoidal equilibrium shapes of a rotating 
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liquid, and also the question of their stability (21, 22]. We shall therefore call 
equation (73.22) Liapunov’s equation. 

If condition (73.20) or (73.21) is satisfied the pressure p can be found from the 
equation 


r dV 
or p= | p(a)n* de (73.24) 


We determine the additive constant from the condition that the pressure 
shall vanish on the surface of the body. 

The expression for the mass tensor and also the equations (73.01) and (73.02) 
involve the elastic energy per unit mass IT which, according to (30.11), is deter- 
mined by the equation 


ed 
1 = | ee (73.25) 
pp 


As a result of (73.23) the integral appearing here is just V, or a quantity differing 
from V, by a constant. This constant can be so determined that one has 

p= (Va—M) (73.26) 
This expression will be of use in deriving the equations of motion from the 
integral relations (73.01). 


74, Evaluation of Some Integrals that Characterize the Internal 
Structure of a Body 

In order to derive equations of motion from the integral relations (73.01) 
it is necessary to evaluate a series of integrals whose values depend on the 
density distribution inside a body and on its internal structure in general. 
To avoid interrupting our subsequent discussion we shall collect all such cal- 
culations of integrals in the present section. 

We begin with integrals that depend on the moments of inertia of the body 
and on its angular velocity. We denote by Q, the potential of the centrifugal 
forces 


Qa = foPol (xi — a4)(xy — ay) (74.01) 
Liapunov’s equation (73.16) then assumes the form 
Ua + Qa a Va (74.02) 
We consider the integral 
Ta= | oQq(dz)? (74.03) 
(a) 
We have, evidently, 
Pa = ufPolPI (74.04) 


so that 7; is the rotational kinetic energy of body (a). We consider also the 
first moments weighted by the function pQq, i.e. the quantities 


T= | eQa(ar —a,)(dz)>— - (74.05) 
(a) 
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Using the definition (72.22) we can write 


Tat = oQoPIg (74.06) 


These quantities vanish if the body has three planes of symmetry. 
We now consider the integral 


ca =} { pvta(dx) (74.07) 
(a) 
which represents the negative of the energy of mutual attraction of the 
constituent particles of the body and we consider also the first moments 


cat = 3 { ptta(z, — ax)(dx)5 (74.08) 
(a) 


With the use of Poisson’s equation (71.12) the quantity c, may be represented 
in the form 


1 
oe { (grad uq)%(da) (74.09) 
8rry 
(9) 


The moments eg; may be similarly transformed as follows : 


1 
cat a | (grad wa)(x1 — a1)(dz)° (74.10) 
Srey 
(2) 
Remembering the definition (71.16) of the quantities g@ 
: OUg GUa 
q@ = 48, (grad u,)? a Das. Bay (74.11) 
with q@) = $(grad u,)? (74.12) 
we can replace (74.09) and (74.10) by 
1 
Eg al q$(dz)* (74.13) 
(%) 
1 
and ae | g(a, — a,)(d2)? (74.14) 
(2) 
These integrals are special cases of the more general integrals 
1 
My 74.15 
Be = a | oertdey (74.15) 
(2) 
1 
and BY), = — | 1 (x, — a,)(a2)" (74.16) 
; ay 
(9) 
In addition to the integrals discussed above, 
cg = BO; tap = Bite (74.17) 


other useful integrals may be expressed in terms of the quantities (74.15) and 
(74.16). 
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We consider the volume integral of the pressure p 
I= { p(dz) (74.18) 
(a) AZ, 
Since the pressure vanishes outside the mass we obtain by partial integration 
a 
a { (a — a,) — (da (74.19) 
0x1 
(a) 
Using the relations (73.19) we can also write 
3l = — f ave Fs 


(a) 
However, in view of the fact that ‘Oa is a homogeneous quadratic function of 
the differences x; — a, it follows from eee s equation that 


OMe (én) (74.20) 


aVa 
— ay) —— = (mj — a1) — +22 74.21 
(24 — az) On: (x4 age = a ae a ( ) 
On the other hand we have by (71.18) 
1 aq 
jolie (74.22) 
Ox4 4ny OX, 
Inserting (74.21) into (74.20) and using (74.22) we get 
1 aq 
31 = — —— { (x; — a) He (dx)? — 2 {pata (74.23) 
Ary ox, 


Integrating by parts and using the expressions (74.03) and (74.13) for ZT, and 
Eq we finally get 


3 J p(dz)? = eg —2T 4 (74.24) 
(a) 
This relation .shows that when the body rotates the mean pressure within it 
is less than in the absence of rotation, as is to be expected. 
Similarly we get the relation 


2 | p* (a, — u)(dx)? = yar — Tot (74.25) 
(a) 
where Net = os { {(%, — ag + (x, — a,)g}(dax)? (74.26) 
or, in the notation of (74.16), 
Nat = (BO, + BO. x) (74.27) 


Equations (74.15) and (74.16) give a representation of B® and B{®, as 
integrals over all space, but they may also be represented as citeorals extended 
only over the volume occupied by the mass (a). To make the transformation 
to this form we introduce a function w, which is defined by the equation 


Y tA é f 
wa =a | o'r —¥ | (de'y (74.28) 
(a) 
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by virtue of which we have 


Awe = Ua (74.29) 
It is then not difficult to prove the equation 
Ou bua (4 0°wWa 
3 
im | Ox Pr or = | P Ox; med eo 
(a) 
whence, by the use of fe “1 we get 
ey 
Bo) — (38 Aw, — = 3 
ta | p (Baw, — 5") da (74.31) 


(a) 
Here the integral extends only over the volume of the mass (a). 
Inserting the integral representation (74.28) for wa into (74. 31) and perform- 
ing the differentiations we are able to write 


Bo —v J | pp Me te =) Caarseaarys (74.32) 


eee Aly 
In analogy with equation (74.31), which may be written as 


ohn (a) 
} : 0x4 OLK oe = §abe — Bie a 
(a) 
one éan also prove the relation 


Owe 
| ° EER (x, — a,)(dx)? = €,,5,, — BY, (74.34) 
(a) 
These relations will also be needed in the following. 

In conclusion we note that as a result of (74.22) and of the equations of 
motion (73.04) the quantity q{?) divided by 4ry may be interpreted, within 
the framework of Newtonian theory, as the stress tensor of the gravitational 
field of the mass (a). 


75. Transformation of the Integral Form of the Equations of Motion 

We take as our starting point the integral form of the equations of motion 
that was given at the beginning of Section 73. We restate these equations 
taking into account that by (73.18) the stresses reduce to an isotropic pressure 
p. We have 


7 1 
-f fevr|a + 5 +a 5 (30? + Til +30)| +5 pu|(dey 
ve eu* 4aU 3 3p 
= one FP ¢3,,2 = YF | (dx)3 
- lea Te e Vf +S ce atte 0) + El tae 
(au, =") 
= So d 75.01 
ie = (i a oy evz(da)$ (75.01) 
(a) 
2qT* 
wi Rue te|e + © (go? + 1 — 0) +4 (75.02) 
ce? at? 
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and AU, = — 4ryer (75.03) 


Putting for brévity 
3 

o=ptSiet+N—U +s (75.04) 

we can write (75.02) as 
RUGS Sates (75.05) 

Neglecting small quantities we get from (75.01) 
d 1 
ai J [onl 1 +5 +5 e+ + 30)| aoe po (dz) 


: 3 aU 4 
= | ae o (dz)? + — =| (SS + ux am) (dx)? —— ; { exes vk —— at (aay 
Ox e c & 
(a) a a 


aan 
We note first that 


5 | 0( 5% Fn) aa ze 7 : | oU (da)? (75.07) 


(a) (a) 


=| 


so that this term may be taken over on to the left-hand side and combined 
with the other terms that are differentiated with respect to time. 

We then consider the first term on the right of (75.06). It is a generalization 
of the expression (71.11) evaluated in Section 71 which referred to the Newton- 
ian approximation. As in (71.10) we split the potential U* into an internal and 
an external part 


U* = uk + U*® (75.08) 
and then have 


eU* du* au*@) 

— o (d: aa.@ 3 3 75 
| ae o (dx)? = { os a (dz) +{ Bi a (dz) (75.09) 
(a) (a) 


(a) 


In Newtonian approximation the first term on the right would vanish by 
(71.19), being the resultant of the internal gravitational forces. But in relativis- 
tic approximation, when Poisson’s equation has to be replaced by (75.05), 


this is no longer so owing to the retardation. We insert into our integral the 
value for o given by the relation 


which holds within the mass (a) and we use the fact that the integral of the 
term involving the ed operator vanishes. This gives 


ai 1 {= duk d?ux* 
Diy 4aryc2 2 
Ps Tye es Ox, Ot 


dz) (75.11) 
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ou* 1 d féu* ut 
oF 4% o dx 3$— a a 3 
© (2) 
0?u* Out 
because a OMG as 
J eT er ae (75.13) 


Owing to the smallness of ia factor outside the integral we can replace the 


quantity uz on the right-hand side by the Newtonian potential ug; equation 
(75.12) then becomes 


1 d f du, du, 
(dx)? = 3 
Nae i 4rcyc® dt J , Oa “at a MoE) 


Following (74.28) we introduce the maces Wg related to ug by 
Awe = Ua (75.15) 


This allows us to transform the integral on the right-hand side of (75.14) and 
to write that equation as 


au* 1d Awa: 
(a) 


(a) 
The last relation could have been derived more directly by inserting the 
approximate solution of (75.10) into the integral on the left-hand side. 
Thus the first term on the right-hand side of (75.06) may be written as 


* %*(a) 
} = = (dxp = [ ee (de +5 | esha? (15.17) 
4 


On; c? dt Ox; Ot 
(a) (a) (a) 
We make yet another transformation. We introduce the function 
W= 27 i e’|r —r’| (dx’) (75.18) 
(2) 
which is a solution of the equation , 
AW =U (75.19) 


U being the Newtonian potential. It is evident from a comparison of (74.25) 
and (75.18) that 
W =D wa (75.20) 
a 


just as the Newtonian potential U is the sum of the potentials of the separate 


masses 
a 


The analogue of the splitting (71.10) of the Newtonian potential into an internal 
and an external part is 


W = wa + WO (75.22) 


Using this, we can replace (75.17) by 
oU* au+@) ldf aewo, | id aw 
— = —-s.= = ad 
i ony Ne | oni ales): c? dt | P 024 at (dz)? + Ties dt ona 
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Repeating the arguments that led us to equation (75.16), now applied to the 
total mass, we obtain the relation 


auU* ld aw 


= : eres ——(dz)? 75.24 
ae TY = aaa)” oa ae) eae) 

(2) (2) 

whence it follows that 
0U*@ ld ow® | 

a dz)?} =0 ° 75.25 
>| Ox; ald) sale an, a1 (79.29) 

a@ @ (a) 


We now consider the last term in (75.06). We divide the “ vector potential ” 
U;, into an internal and an external part 
U, = Uy, + U@ (75.26) 
and conclude as before (see Section 71) that by virtue of Poisson’s equation 
(75.03) we have 


OUak * 
= 75.27 
[ om $2 (ax =o (75.21) 
(a) 
and that; therefore the last term in (75.06) will be equal to’ 
4 zx, , 5 4 ae 
(a) (a) 
We also have 
OU: 1445 
[ em a (dx)* = 0 (75.29) 


We have discussed all the terms on the right-hand side of (75.06). Of these 
terms we transfer (75.07) and the last term on the right of (75.23), both of the 
form of time derivatives, to the left-hand side; then the equations of motion 
(75.06) assume the form 


dP 
a =e (75.30) 
where 
1 1 1 ow 
= — (1 t 
Pu i fen a (dv? + TT + 30)| + Pe 2 eU; ae a ~ a2 
(75.31) 
and 
i au*@ ld ¢ aw@ 4 aU@ 
Fas = | aap 5 | aera Syst k 
= | oe, OP — ae | P ae, a(t — a | ere ade 
(a) (a) (a) (75.32) 
As a consequence of equations (75.25) to (75.29) we have 
> Fa = 0 (75.33) 
a 
and therefore > Pa = P; =-const. (75.34) 
a 


By analogy with Newtonian mechanics the quantity Py; may be interpreted 
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as a component of the momentum of the mass (a). Then the quantity Far 
will be the component ofthe force acting on that mass, Such an interpretation 
is very natural in Newtonian approximation, but here it is somewhat artificial 
because according to (75.31) the momentum P; then depends not only on the 
internal structure of the body and its velocity, but also on the potentials U, 
U;, and W. 

We shall now separate off those terms of Pa; in (75.31) which depend only 
on the internal structure. 

Within the mass (a) the quantities u, and ug; satisfy the equations 


Aug = — 4rrye ; Auat = — 4rryer% (75.35) 
Hence it follows that 


{ eviua(dx)? = J otai(dc)® (75.36) 
(a) (a) 


Using this relation we can express Pa; as 


1 1 e O*Wa 
Par= | eve + our (do® +11 ~ ue) +9 — 3 $) (aay 
(a) 
3 4 1 o2Ww@ 
cs U@ 3 (a) 3 dx) 75.37 
+ al 01,0 (dz) 5 | 004 (dx) ale on a x)? (75.37) 
(a) (a) (a) 


Here the first integral depends only on the motion and the internal structure 
of body (a) whereas the remaining three terms depend also on the potentials 
of the external field, i.e. on the interaction of (a) with the other bodies. We 
shall deal with the evaluation of these integrals in the following section. 


76. Evaluation of the Momentum in Second Approximation 
In the preceding section we reduced the equations of motion to the form 


= Pig (76.01) 


where the momentum P,; and the force Fas were expressed as the integrals 
(75.31) and (75.32). We must now evaluate these integrals and express them 
in terms of the parameters that characterize the motion of the bodies as a whole. 

In order to evaluate Pas we use its representation in the form (75.37). We 
split the momentum into the part intrinsic or “ proper” to the body and the 
part arising from the interaction, writing 


ith Put = (Pat) prop a (Pat)inter (76.02) 
wi 
(Pat) prop = | gui(dey + | vi(}pv® + ell — pua + P)(de) 
2 c 
(a) (a) 1 OW, 
sagt dx) 
a | ° oa, ai z)> (76.03) 
(a) 
and ; oo) 
4 —_—_—(dz)* 
3 2 pUP days 8 ( 
== | prU@ de) | 0U | Bary Ol 
(Pathnter =| i ar “@ (76.04) 


(a) 
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We first calculate the integrals appearing in (76.03). The first gives the momen- 
tum in Newtonian approximation—it is 
| evi(dx)* = Mads (76.05) 
(a) 
in agreement with (71.08). In the evaluation of the second integral we utilize 
the relation 


ell — puta + p= Qa (76.06) 
which follows from (73.26) and (74.02) and also the relation 
fo? = 4a? + do (xz, —a,) + Q, (76.07) 


which is a consequence of the equation 
vy, = d, + oP (x, — a,) (76.08) 
for the velocity within the body (a). The second integral may be seen to be equal to 
{ ov(dv? + Q,)(dx)® = (4M ,a2) 4, + 27,4, + OP a@IOG, + 20PT,; 
(a) 
: (76.09) 


with the notation of (74.03) and (74.05). The last integral in (76.03) may be 
transformed as follows : 


= fe Owe 4 a = | ip tds (76.10) 


With the use of (74.33) and (74.34) we then get 


o2 Pag 
= [eo Sfde? = 6, — BOG, Foley aR OUD 
We collect all ines terms and introduce the notation 
Zip = OT, + &¢)8y + oPolQly — Be (76.12) 


and 
ZO = WOT, + oMe,, — OB, (76.13) 


This gives for the intrinsic part of se momentum 
cg iol. aol : 
(Pat) prop = Ma, + a (4M a?) a, + a (Za, + Z) (76.14) 


Here the first term represents the Newtonian expression for the momentum, 

as we have already noted. The second term gives a correction well-known from 

point mechanics. The last term may be interpreted in terms of the notion of 

a tensor of effective mass, which is the matrix occurring when the components 

of Cah aatay are expressed as linear functions of the components of velocity. 
we put 


: 1 
K = GMa +7.) +5 > [EM d) + 424d, + ZG] (76.18) 
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we evidently have aK 
(Pas )prop = oe (76.16) 
4 


This equation determines the quantity K apart from a function independent 
of the ay. In (76.15) we have included a term >T, in order to ensure that in 
non-relativistic approximation K should go over into the usual expression 
for the kinetic energy of the system of bodies. 

We note that for non-rotating bodies with spherical symmetry we have 


BY = Be a0 44 3 ZR = $e, 5 ZP =0 (76.17) 


Therefore, if we introduce the effective mass 


2 
Ne = M, + 302 Ea (76.18) 


we shall have, apart from small corrections, 


1 

K = ima +5 > Yn.) (76.19) 
a a 

just as for a set of mass points. Equation (76.18) shows that in this case the 

effective mass tensor reduces to a scalar. 

We now go on to the evaluation of that part of the momentum which depends 
on the interaction. We begin by estimating the order of magnitude of this 
part. It is readily seen that all three terms in (76.04) will be of the same order 
of magnitude, namely 


(Pat)inter = o(m 4 (76.20) 


where q is the quantity already used on several occasions which is of the order 
of the velocities in question. When calculating the integrals in (76.04) we retain 
not only the leading terms, which are of the order of magnitude just mentioned, 
but also terms of order Z/R in relation to the main terms; terms of higher order 
in L/R will be discarded. 

To calculate the integrals to the required accuracy it is sufficient to retain 
only the first term in the expression (71.27) for the Newtonian potential, so 
that we take 


1 yMy 
U@(r) = ees (76.21) 
In the expression 
f ,, (dx')? 
U(r) = > x { ¢) U; lr—r] (76.22) 


for the vector potential, on the other hand, we must retain not only the main 
term but one more. Using (71.25) we get 


, M yb U rs) 1 
up = eS etna ety 809 
b 


|r —b| eT Ox, |r —b| 
Finally the function 


rl] WO Tat 
W® (r) = S me | e|r—r | (dx’)8 (76.24) 
p> @) 
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will be equal to 


W@ (r -3> 7M, |r —b] + iat i”? we a a (76.25) 
to the accuracy ee 
Here we have retained one term beyond the first in order to ensure that the 
derivative of W‘® with respect to time may be obtained to the required accuracy. 
Inserting into the first integral of (76.04) the expansion of U(r) into a 
Taylor series about the point ty, = ay, 1.e. 


eU® 
U® (r) = U@a) + (ae — a)( a (76.26) 
Orr / a 
we get 
eUu® 
3 | 0v,U@(r)(da)? = 3M,4,U(a) + 26 T9( = (76.27) 
k/a 


(a) 


Here the expression (76.21) for the Newtonian potential due to the external 
masses must be inserted. We then get 


*3yM aM ya; f 
3 | ,U(r)(dz)* = > yates > 3yM,wP1@ es 
b t7) 


|a —b| K ® Bay ja —b| 
(76.28) 
The second integral in (76.04) may be evaluated similarly. We have 
—4 | eU(r)(dx)? == — 4M ,U (a) (76.29) 
(@) 
and if the expression (76.23) for U\® is inserted : 
a 4yMaM yb; ta) 1 
— 4 f pupptenday = — DB) SEE + Dl oP ay 
ss (76.30) 
Finally, the last integral in (76.04) is 
adit e2w@ 
= —M,|——_— 76.31 
-|e Bx, at | (=), 
Differentiating (76.25) ‘with respect to time we get 
aw 1x alr —b 
r= -3> yM yb, HE? Pl 77 Im * mel (76.82) 


To the accuracy here ee we must ee terms nee third 
derivatives of |r — b|. According to (72.16) the quantity 1® has the value 


1® — w®I® + oT (76.33) 
By differentiating (76. ‘ee we obtain 
2w@ a e|r —b |r —b| 
ee A 
Oar, at 5D ae re a Or +> wy ftdayde 


and therefore 


Ley, lw 2la —b l ; oc ae 
—fe Ox; a = 5D. MM, 12 =? | eo 7M [© *|a | 


Ca; Cay 4 av ak Gay Ca; Cay 
(76.35) 
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According to (76.04) the sum of the expressions (76.28), (76.30) and (76.35); 
divided by c?, gives the quantity (Pot)inter. be 

We now introduce two functions K, and K, such that the first is homo- 
geneous quadratic in the velocities and homogeneous of degree —1 in the 
coordinates, while the second is homogeneous and linear in the velocities and 
homogeneous of degree (—2) in the coordinates. 

We do this by putting 


1 yU.M, 


: (anaes +5 __ as 1 : la —b 
1 42 ra Ja—b]| =>] (3a; + 3b? 84,5,) + Te >-M,M,é,, alta 
(a¥b) ab) ie 
76.36 
and 
1 
K, = — > [M,oP1@(34, — 46,) — M 0136, — 4a,7 2. 1 
2 952 Z VLMa TP (34, ‘) a7 $2 (36, 4a,)] Gay [a —b| 
(a#b) 1 | | 
ala —b 
aoe (a) __ ; 
+ ge 2, AM SL? — 4,19) | (76.31) 
(a#b) a ais: 
It is then easy to verify that 
OK, OK 
Pa eee 
(Patlater = “5g + SH (76.38) 
Remembering the estimates 
ae z I~ML?; ft ~ MgL (76.39) 


it is readily seen that the order of the functions K, and K, is given by 
L 


7 (76.40) 


4 4 
K,~M4; K,~ MC. 
c 
Thus K, will be small in comparigon with Ky. 
Using (76.16) and (76.38) we can write the total momentum in the form 


é 
ay 
where K, K, and K, have the values (76.15), (76.36) and (76.37) respectively. 


77. Evaluation of the Force 
In order to evaluate the integrals which according to (75.32) are involved in 
the expression for the force 


aU*a) 1d a2Wia 4 | 2u@ 
—_— k eae co ae a 3 
Fa | 7a o (dz) oa}? Omar (dx)? — = | pve Ba, (dz) 
i i (77.01) 


we must first of all find the potential U* to sufficient approximation. As regards 
the potentials W and U;, the first order expressions already derived for them, 
(76.32) and (76.25), will be sufficient. 

According to (75.05) the potential U* satisfies the equation 


(a) 


AU* — — = — 4ryo (77.02) 
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where o is given by 
1 
o=p + (dev? + ell — eU + 3p) (77.03) 


and. differs little from p. If we know the Newtonian potential U that satisfies 
the equation 

AU = — 4rye (77.04) 
the generalized Newtonian potential U* may be obtained from U by introducing 
two corrections. The first is for retardation, the second arises from the replace- 
ment of p by c. The latter can be written as 


(o’— 6’) 
Usaa = | 77 (da)? 
re [r —1'| 
As regards the retardation correction, it can be expressed in terms of the 
function W already introduced by (75.18) 


(77.05) 


1 
=i fe lr—r|@ey (7.08) 
(@) 
We then have 
1 ew 
U* = U + oe oe + Uaaa (77.07) 


We now deal with the evaluation of Usag. We write the difference of o and 9 as 


1 1 
5 — p= —(8ov" + ell — eta + 3p) — = pU™ (77.08) 


Here the first term depends.only on the internal structure of the body and on 
its velocity while the second depends also on the external potential. Using the 
equations (76.06) and (76.07) we get 


| (Geo? + eM — pu, + 3p)(dz) = Mad? +E, (77.09) 
(a) 


where ig= | (4pQ, + 2p)(dx)? (77.10) 
(a) 


Also, the first moment becomes 


[ (ov? + ell — pu, + 3p)(x, — a,)(da) = 34,010 + E,, 
(a) 


(77.11) 
where Eat . ) (4pQa + 2p)(ay — az)(dx)? (77.12) 
a) 
The integrals appearing here were evaluated in Section 74 : 
Ea = fea + 87a (77.13) 


and Eat = not + 37 a4 (77.14) 
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Hence we get, approximately 


d 
(Sov? — pI — pua - 3p)’ paar 
(a) |r a a | 


1 
= (3M a2 +8) ay + ORI + 6, ja. RIS) 


“lF al? 
Further we have, to the accuracy required 
'U@(r’ M,U™ 
Ce ee (77.16) 
|r —r’| |r —al 
(a) 


Combining these results we can express the quantity (77.07) 


: 1 
Uaaa = = > (3M a2 — M,U™(a) + &) ]rF—al 
+S (84,001@ + #1 


as 1G = ap (77.17) 
a 
The equation (77.01) for the force acting on mass (a) involves not the whole of 


U* but only that part which is due to the external masses 
This part is 


1 aw 
U*@ — Yo Se oes UM, (77.18) 


whcre U(r) and W')(r) are the quantities already determined in Section 76, 
and the additional term U{), is given by 


1 
@ — TS (3a,b2 — M,Ub) +2 
Ona a - (3420, b (b) Fb] 
b 
rae (3b, + £ WE =e (77.19) 


The values of potential obtained must now be inserted into the expression 
(77.01) for the force. We get 


au@ 
Fux | 2, a + oe (ay + | Ste — ante 
@) 


(a) 
4 aso a ld o2W@ 


DSS dx)3 
"Bay Ot? ue) edt J Gay at oe 
(a) (a) 
4 au) 77 90) 
aoe dx)? (77.20) 
sa | Ue (dz) 
(a) 


Here the first intceral is the Newtonian expression for the force ; it has already 
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been evaluated in Section 71. We have, by (71.33) 
(a) eo 3 
[em ds = =-— (77:21) 


aa 


(a) 


where ® is the Newtonian potential energy of a system of bodies, as in (71.32). 
To the accuracy required the second integral may be written as 


[3 Usa -p (dx)? = Ma (Sa) (77.22) 
) 


0x4 


Here the expression (77.19) can be taken for U@,. The third integral can be 
evaluated by use of the relations (77.08) to (77.14). We obtain 


| te — evaey = 5 (F*) aneaay +8) 
a) 


1 /ayu@ ] oU 
ca @) @) ae (a) 77.23 
s(Secar) eiPlQ +8) Zale") (77.28) 


where the expression (76.21) may be used for U‘@), 

We add the expressions (77.22) and (77.23) and represent the sum as a 
derivative with respect to a;. In (77.23) we may put 24 = ay before differentia- 
ting, but in (77.22) we must take into account that one of the terms in Ub) 
itself depends on a;, namely 

Ma 
a —b| 
where the dots denote terms not containing ay. As a consequence of (77.24) we 
have 


U(b) = Bact (77.24) 


1 -2| b 1 1 yMa 
da; Jab] dal [ab] 3 Ja — 


Using this relation we get 


aU. 34 or 
oe se (dz)? + ee eG — e)(dx)? = s(n + L,) — ax 


V(b) aE wal (77.25) 


(77.26) 
where we have put 
1 
y= 5 D, By MaMy(Gh +B) + GM, +E, Maa 7 
ee b) 
(77.27) 
and 
L : 384M w) 1b, 3M. o@I@g - 
2 D2 > | Ye ask 3j as ¢ 1c Ose I ay. + y(41,6,; — MEN ep =p 


(a¥b) 


(77.28) 
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and where the function ¥ is as, to 
M2M, 


1 
Ve) M,M,U® engees a)(q)24 
or (1 ext ab] = Se in) Tr a 


(77.29) 


with dots again denoting terms independent of a;. We arrange these terms in 
such a way as to make ‘¥ symmetric in all masses and the corresponding position 
vectors. We put 


yy MaMo(Ma + Mr) 


(a,b) = ae Ce (77.30) 


bo 


and 
y? 
VY (a, b,c) = aM MM, 
° : 1 ; 1 i 1 ) 
faba —e] * bee —al * fe—ale | 
(77.31) 


where, temporarily, we have denoted the speed of light by cy, to avoid confusion 
with the suffix in M,. Thus it is readily seen that in the function 


WY=2>V(a,b) +4 > ¥(a,b, c) (77.32) 
a,b a,b,¢ 
(a#b) (a#b, b#e, c#a) 
the'terms depending on a; are the same as in (77.29). Thus in the equation 
(77.26) for the sum of the two integrals ‘’ may be understood to stand for 


(77.32). The terms (a, b, c) give peculiar “ triple interactions ” of the masses. 
We now consider those integrals in (77.20) which involve the function W‘). 


We have, evidently 


3 (a) d a2Wa 
shale Mail 
c Ox, 0t? c? dt Ox; Ot 

(a) (a 
2 4 3] (a 
ab fee ~ 3] a; ai dey (77.33) 
ob 0x4 a | 
(a) (a) 


This expression can be evaluated by use of the relation (76.34) for the 
derivative of W™. We get 


1 33 Va) Fe) Oe) OP 
20 dpe ac (77.34) 
=f ena a ear oa 
(a) | ! 
1 aia —b 
= ; ees) 7.35 
where L,= ie > MoM yajbx Bay Bae (77.35) 
ast = 
1 @la —b 
" (a) _ @y 77.36 
end i= 8c? > va, bl Ma 4 Fag 0a, Oa, ( ) 
a,b 
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It remains to examine the last integral in (77.20). Its evaluation presents no 
difficulties and we get 


4 oU@ ,., eb, , aL, 77.37 
— 5 | om an4 ee a ae eee 
(a) 
2 ~M aM o(axbx) 
where L, SS ) ~ fab] (77.38) 
a,b 
(a#b) 
2 aj — by 
and .=S > 1(M,b,.0@19 — MadpoI®) a bh (77.39) 
(ab) 
We collect all integrals and so get the following expression for the force 
ap at 7) 0 
SE ee Ee —(L+L,4+1L 
Fa = Oa aay Ss eae + L, + L,) + re py + ‘4 + ») 
(77.40) 


Let us compare this expression for the force with the previously derived 
expression (76.41) for the momentum. We put 


—— > yéoMa + EM (77.41) 
(a#b) 
1 ay — by 
and ®,= —55 > veyMa — ‘aM bh (77.42) 
(ab) 


The negative of these quantities also enters the expressions (77.27) and (77 .28) 
for Z, and Z,. 


It is simple to check that we have 


[L,+L1,+1,=K,—9, (77.43) 
and similarly 
L,+l,+1,=K,—9, (77.44) 
where K, and K, have the values (76.36) and (76.37). Therefore 
a 
Pa = 57 (Ky + K,—® —®, —®, —¥) (77.45) 


On the other hand, by (76.41) 


a 
Pa = aqj(K + Ki + K,) (77.46) 


where K is given by (76.15). 


Here K is independent of the coordinates a; while the quantities ®, ®,, 
®, and are independent of the velocities d,. Putting 


L=K+K,+K,-®—0,-0,_¥ (77.47) 
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we can therefore write 


: aL aL 
P => ——* —— nr 
ua Fane (77.48) 
The equations of motion 
aP ai 
“a a Fu (77.49) 


then appear in the Lagrangian form 


a (77.50) 


78. The Equations of Translational Motion in Lagrangian Form 


In the preceding sections we have derived the equations of motion for the 
mass centres of a set of masses, by use of the condition 


| gV,T#(dz) = 0 (78.01) 
(a) , 
which in turn was derived from the harmonic condition (see Section 70). It 
was assumed in the derivation that each mass rotates about its own mass centre 
in the manner of a rigid body. 
As we saw, the equations of motion are expressible in the Lagrangian form 


——- ——__=(90 78.02 
dt Od, Oat ( ) 


where the Lagrangian is obtained by inserting into (77.47) the expressions 
previously found for K, K,, K., ®, ®,, ®, and Y. (See equations (76.15), 
(76.36), (76.37), (71.32), (77.41), (77.42) and (77.32).)+ We restate these 
equations. We have 


L=K+K,+K,—®-—0,—0,-—¥ (78.03) 


where the various terms have the values set out in the following : 


1 at : 
=D GMa +7.) +3 D, (Mala) + 12Pad, + 294) 
a a 
(78.04) 


Here the first sum represents the usual Newtonian kinetic energy of the 
translational and rotational motion of a system of bodies. (In forming the 
left-hand side of Lagrange’s equations it is inessential to include the term 
Ty.) The second sum gives a correction to the “ kinetic” part of the Lagran- 
gian, i.e. to that part which depends on the velocities. In the absence of rotation 
this correction reduces, according to (76.19), to the usual correction term well- 
known from the mechanics of mass points ; when rotation is present this cor- 

+ For non-rotating masses the reduction to Lagrangian form was first performed by 
T. Fichtenholz [40]. 
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rection also involves ‘‘mixed ” terms which depend both on the velocities of 
translational motion and on the angular velocities of rotation of the body. 
The quantity K does not depend on the position of the mass centre of the body. 

We now state the values of K, and K,. According to (76.36) and (76.37) we 


have: 


| yMaM»,. : 1 : a —b] 
Ka = 75D, ja—o] + 30} — 84) + 7 > YM Motibe Far 
a 
a,b (78.05) 
and 
1 ()70)/(3h. — 4g MwOI@(3a, — 4b 2 = 
Ky = 35 > (yM wo Ig (3b, — oO — Mog Lap (3a, — £0, |a —b|? 
a,b 
1 &|a —b| 

ane (a) _ 1,10) .-—1 78.06 
+. Be2 >» (yM, 6,14 yM,4,1¢)) da, a, Ga, ( ) 


a,b 


These terms depend both on the coordinates and on the velocities and repre- 
sent, so to speak, the result of an interaction of kinetic and potential energies. 
As we already noted in Section 76 the quantity K, is homogeneous and quadratic 
in the velocities of translational motion a; whereas K, is bilinear in the velocities 
of translation and the angular velocities. 

The next term in the Lagrangian is : 


e 1 
Cay Cay |a = b| 
(78.07) 
It represents the Newtonian potential energy of the system of bodies. It is 
augmented by the corrections ®, and ®, which, by (77.41) and (77.42) are 


Eyal ry — 
22.Ja—b] 142," ner el 


1 . 1 
— 81 55 2, role + Eola) (78.08) 
1 aj — b; 
and —0,= gD, VEwMe — Eas hi (78.09) 


The quantity ®, may be interpreted. as the result of replacing the mass M, in 
the Newtonian potential energy, or more precisely in its leading term} 


®=-}> ae (78.10) 


a,b 


by the effective mass M, + 8M,, where 
1 
sM,= a Ea (78.11) 


On the other hand, the quantity ®, may be interpreted as the result of dis- 


{ In the equation (78.07) for ® the correction term is of order L?/R? relative to the main 
term. Consequently relativistic corrections to the correction term will be of the same order 
as terms already neglected. Therefore it is immaterial whether such corrections are intro- 
duced into the complete expression for ® or only into the main term ®,. 
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placing the mass centre of body (a), so that a, goes over into a; + daz, where 
1 

7M a 

Indeed we have, apart from small quantities, 


- S (Ma + 3Ma)(My + 8M») 
“2, |a+3a —b — db] 


3a; => 


Eat (78.12) 


=0,+0,+9, (78.13) 


We note that the quantity & involved in the equation for the effective mass 
is equal to the intrinsic energy of body (a), which is composed of the kinetic 
energy of its rotation around its mass centre, of its elastic energy and of the 
gravitational energy of its component particles. Indeed it can be shown that 
the expression 


= 728 | ell (dx)? — 4 | otta(dax)® (78.14) 
(a) (a) 


for the intrinsic energy goes over into the form (77.10) for Eq if Liapunov’s 
equation and (73.26) are used. 

It might be expected that the presence of the mass related to the intrinsic 
energy of a body would give rise not only to a change of the effective mass 
but also to a displacement of its mass centre. However, in view of the fact 
that the kinetic energy of rotation about its mass centre refers to the body as 
a whole and is not related to the individual constituent particles, it would be 
difficult to predict the correct expression for the displacement of the mass 
centre. Actually the calculation has proved it to be given by (78.12) with 
Eat given by (77.12). 

It remains for us to write down the last term in the Lagrangian. It is con- 
venient to denote the three position vectors occurring in it by a, b and c and 
we therefore adopt the notation cy for the speed of light, as was done in Section 
71. According to (77.30), (77.31) and (77.32) we have 


y? ~< MaMo(Ma+ Mo) y* 
i? ne a 


eee eee : a 78.15) 
: (cine + pep aay *femayje—oy) 

Here the last sum gives the triple interaction of the masses. { If in the aces 
of a remark made above we call the terms K, and K, the “ kinetic-potential 
part of the Lagrangian the quantity ‘* may be called its “ potential-potential 
part. These names stress the fact that in the second approximation of Einstein’s 
theory the additivity of kinetic and potential parts of the Lagrangian, character- 


istic of Newtonian theory, no Jonger prevails. 
According to (78.03) the total Lagrangian is the sum of the expression (78.04) 


+ In the triple sum the labels satisfy the relations a = b,b #¢, ¢ 4, and in all double 
sums we must take a 4 b 
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to (78.09) and (78.15). To the present degree of approximation it leads to the 
equations of translational motion with relativistic corrections. 

As for the equations of rotational motion, they may be obtained from the 
relations (70.28) which have the form 


ioe | g(uiV,T% — 24V,T%)(dx)? = 0 (78.16) 
(a) 


In Section 72 it was shown that in Newtonian approximation these equations 
determine the rate of change of the angular momentum 


M® = WPI — wPI@ (78.17) 
of each body, giving 


+ 3vyiM = 
d M® = > 3yM o(a; ba an — be) — (a; —b)I@] (78.18) 


dt . |ja—b|>~ 


(see equations (72.07) and (72.13)). 

The relativistic corrections to the Newtonian equations of rotational motion 
may be obtained by a more exact evaluation of the left-hand side of (78.16). 
These calculations can be performed on the pattern of those performed for 
translational motion in the preceding sections. They do not present any 
difficulties apart from the complexity of the manipulations involved. However, 
since relativistic corrections to the rotational motion of celestial bodies are 
quite insignificant and are even harder to observe than the corrections to their 
translational motion, we shall not evaluate them here. 

79. The Integrals of the Equations of Motion for Systems of Bodies 

Just as was the case in Sections 26 to 28 for a system of particles interacting 
by means of the electromagnetic field, a system of gravitating bodies has the 
property that its equations of motion possess the ten classical integrals, or 
constants of the motion, namely the integrals of momentum and energy and 
those of angular moinentum and of mass centre motion. 

We shall now derive general expressions for these constants of motion in the 
form of definite integrals. In doing this we shall proceed as in Section 71 and 
make use, for the internal motion, of the general equations of motion of an 
elastic body in non-relativistic approximation. We restate these equations. 
We have the equation of continuity 


0p , Apr) 2 
a i a 0 (79.01) 
the equations of motion proper 
- oU _ Opir 5 
pus Pon mo (79.02) 
in which the acceleration w; is given by 
ee Ov; 4 Ov; = 
i= at OF oe (79.03) 
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and finally the relation giving the elastic potential energy : 


dl Ou dv 
a= pn ( ot a ) 


Dey on (79.04) 


We shall not at present introduce the assumption that each body rotates in a 
rigid manner. 


To obtain the constants of motion we may start from the relations which, 
according te Section 70, result from the harmonic conditions. The relations 


= | 9V.2e4(dz)° =0 (79.05) 
(%) 
integrated with respect to time give us the three momentum integrals, and the 
relation 
= | gV,T(dx)> = 0 (79.06) 
(2) 


“integrated with respect to time gives the energy integral. Similarly the relations 


= | 9(x,V,T% — x, V,T*)(da)? = 0 (79.07) 
(~) 
will give us the three integrals of angular momentum and finally the relation 


= { gu,V,T2(dx)? + a5 { gV_,.T%(dx)® = 0 (79.08) 
(x) (2) 
(in which, incidentally, the second term vanishes by virtue of (79.05)) leads 
to the three integrals for the motion of the mass centre. 

We begin with the evaluation of the momentum. In deriving the conserva- 
tion law from (79.05) we can use the results of Section 75. Generalizing equation 
(75.31) a little we put 
1 aw 
peered 6 
oe Ox "| ( a 

(79.09) 


and then form the derivative of this quantity with respect to time. Using 
(73.01) we first find 


1 1 4 
5 { {em “fe nee ++ 30)| = pep iktk = “ae: — 


(20) 


7 4 ou; 
ee ae (an)? = i a ° oN dz)? — =| ov, — (dz)? 
dt Oz; c? dt Ox, Ot c Ox; 
(79.10) 
Pkk 
where c= + F(got +11 —U) ers (79.11) 
and the function U* satisfies the equation 
1 e2U0* 


But since all integrals in (79.10) are to be taken over the whole of space, relations 
(75.24) and (75.29) may be applied to them, according to which 
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_! d ow 

——— dz)? 79.13 
[ Fo ayr~ 55) ea (79.13) 

oU 
and | si ae SO (79.14) 

OX4 

Hence it follows that 

ai ; Py = const. (79.15) 


so that P; is a constant of the motion. This result represents a generalization 
of the relation (75.34) for the general case of a system of elastic bodies inter- 
acting by means of the gravitational field. 

We now go on to the formulation of the conservation law for the angular 
momentum of such a system. Introducing the expression (70.22) for gV,7% 
into (79.07) and using the relation (70.26) for the components of the mass 
tensor we get 


5S (le = 5 + II + 30)| (xiv, — LR) — “nee = pre) (de) 
= -| (« au — nls (dx)® + — ale (2. ve) a) 


U; au : 
| male —aS)ide)* (79.16 


where for brevity we have put 
dU, av; au; 


a at Gay 
If in (79.16) we extend all integrals over the region occupied by one of the masses 
we obtain a generalization of the relation (72.04) that gives the law of the change 
of angular momentum in Newtonian approximation. We are here interested 
in the total angular momentum so that the integrals must be extended over 
the whole of space. 
We introduce the notation 


(79.17) 


G= ee 1 mn 1 Ww 

i le + ee? +H + 30)| v4 Puts — “201 2° Ba, at 
(79.18) 

using which, we can write equation (79.09) for the total momentum as 
Pi= | G,(dz)? (79.19) 


We can then represent (79.16) in the form 


(uiGy —= xxG's)(dx)® 


oU* ) ld ow ow 
= %4——__ — &_———_ dx)? — — — 2 ep 
{ (2.5 ‘ : 0x4 aAee) c? dt { o( a Oxy, Ot TF or; =) (ae)? 


OU; au; 4 
= oe =| gu eo — Uk £) (an) = 7) [eto — v~U4)(dx)8 
(79.20) 


d= 
dt 
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We prove that the first two integrals on the right cancel and the remaining 
two are separately zero, so that the entire nght-hand side vanishes. 

In evaluating these integrals we shall use the following lemma: 

If the function ) may be expressed in terms of the density p as 


p= f f (|r — 4] u(r’)(de') (79.21) 
where f is some function of the distance |r — r’|, then the relation 
a af 


is true. It is assumed here nee in cae (79.21) one may differentiate under 
the integral sign. To prove this it is sufficient to insert the values of the deriva- 
tives of }, as calculated from (79.21), into (79.22) and to note that in the 
resulting double integral the mtegrand is antisymmetric in the coordinates of 
the two points r and r’. 
We have already encountered a similar relation when deriving (71.14). 
Remembering equation (77.07) we may write 


1 aw 
U*=U,+- ae (79.23) 
where U, = U + Uaaa is a solution of the equation 
AU, = — 4nyo (79.24) 


The second term in (79.23), in which W denotes the quantity 


W =}hy i e’|r —r'|(de’) (79.25) 
(0) 
is a retardation correction. 
We consider the first integral on the right-hand side of (79.20). By the lemma 


just proved 
i (2 We ‘)o (dz) = (79.26) 
CLk Out 


Therefore we have 


eu* a 1 ow aw Ne 
| (« a =) od ate | (x Ott On,” at? Oa) ® a) 


(79.27) 
On the other hand, for 
j dp | ow 
the lemma gives the relation 
aw a now) e Bc 
ae (79,29) 
| (x aa ae) 


Using this we get 


d ew aW\ ( ow ow am) oa 
di (x at Om,” a ma) = | elt saa, BP oe) 
(79.30) 
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and therefore 


oU* _—au* ld aw an) ; 
[ao am) = rad oo sa —*¥ oan) 


(79.31) 


so that the first two integrals on the right of (79.20) do indeed cancel. 
The third integral on the right of (79.20) vanishes as a result of the same 
lemma ; one has simply to apply it to 


f= 


Finally, the last integral vanishes because Poisson’s equation holds for Uy. 
Thus the entire right-hand side of (79.20) is zero and we have 


—/! _ . = 79.32 
rae p=ey; v=; ( ) 


_ i (iG — xxG)(dx)? = 0 (79.33) 
(x) 
If we introduce the total angular momentum of the system, 
Mu = { (1iGu — 2xG1)(dax)? (79.34) 
(20) 
we have 
Mix = const. (79.35) 


We recall that in these equations the quantity G, has the value (79.18) and 
differs little from the momentum density pvy. 

We now go on to the formulation of the law of energy conservation for a 
system of bodies. In contrast to the other conservation laws, the law of energy 
conservation can be derived from (79.06) only in Newtonian approximation, 
the accuracy of calculations being the same as for other laws. A more exact 
form of the energy conservation law can be obtained by discussing the Lagran- 
gian form of the equations for the system of bodies. For non-rotating masses 
this will be done in the next section. 

Since the main terms p and pv; in the components c?Z and c?T satisfy 
the equation of continuity (79.01) the relation (79.06) amounts to the same as 


c J V_T°(dz)? = 0 (79.36) 


to the order of approximation considered. Introducing the expression (65.23) 
for the divergence into the last equation we find 


d aU 
Zi | e270 (dx)3 + | ap T (dx)? = 0 (79.37) 
and, using the value (70.26) for T® we-get 
d 1 1 aU 
af eb + aGo + 0 — lay + 3 [e@mr=o (79.38) 
Since separately 


Me = i o(dr)® = const. (79.39) 
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we must have 


d 
| e(30? + IT — U\(days +] o% (dz)3 = 0 (79.40) 


To obtain from this a conservation law in the narrower sense we must trans- 
form the second integral in (79.40) into the form of a time derivative. This is 
easily done with the aid of the relation 


| (< = 0) any =0 (79.41) 


which follows from Poisson’s equation for U. Subtracting half of (79.41) from 
(79.40) and putting 


E= { o(fv? + II — 4U)(dz)3 (79.42) 
we get 
= =); E = const. (79.43) 


The integrals of the separate terms in (79.42) represent the kinetic. elastic 
and gravitational energies of the system of bodies in Newtonian approxi:nation. 
If we do not separate off the terms (79.39) from (79.38) and put 


M= | o( sf (30 41 10)\(azy (79.44) 


we also get 
M = const. (79.45) 


The quantity M is the total mass of the system of bodies. It is equal to 


E 
M=M°+—5 (79.46) 


where M° and E£ are defined by (79.39) and (79.42). 

It remains for us to examine the integrals of the mass centre motion of the 
system of bodies. They may be obtained starting from the relation (79.08). 
This amounts to transforming the expression 


4 f aoft + aut +11 — 1)](@o) 


= { eo + 5" + — _ 


— U) 


1 
+ Af aepopuslda)? + 55 [xe ae cos 
= 


with the aid of the equations (79.01) to (79.04). Inserting the value for pw; 
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given by (79.02) into (79.47) we get as the sum of the last two integrals 


1 dil dU 1 1 eu 
a { nioluy ae 5 |e as 5 | Preedey = al xp = (da)? 
(79.48) 


Using the notation (79.18) we can write the result of inserting (79.48) into 
(79.47) as 


: { nel + = (10" +1 — U)\(aoy 
a i G, (dx)? — a| ev (da) + 5 i eU; (da)? 


1 aU 
— — (dx) 79.49 
a a] ears ° Bary at a “P — 5 | 2 ot (@2) ( ) 
Adding to this the obvious equation 


1d 1 aU. a 
cyan iso ras as 3 
5a aipU (dz) To ne a +U =) (ae (79.50) 


we can replace oe by 


5 { 20(1 +5 + (07 +1 — 10)|(azy 
= | ex (dey —5 f owt (day +5 [ote (aay 


lf ew 1 au a) 
— = SiS eer 3 
od rer (day? — 55 | nes U — | (dz) (79.51) 


This equation is valid if the region of integration includes one or some of the 
masses, but if the integration is extended over all space it can be simplified. 
By using the Poisson equations satisfied by U and U; we get 


| ouU (dz)? = { oUs (dx) (79.52) 
We also have 
aw aU Je) 
3—1 eet Se Py ae 3 
pg doy = 4 { mo? — 02) cao (19.58) 
The latter relation can be most simply verified by noting that the function 
aw 
= UU — ac = (79.54) 
x4 


satisfies a Poisson equation of the form 
AQ: =.= 4mypay (79.55) 
As a result of (79.52) and (79.53) all integrals on the right-hand side of the 
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equation, except the first, cancel and we get 
d Le 
/ a| (1 + rie + ~— 1U)}(aa® = | Gi(dzx)® =P; (79.56) 


where, by (79.15), P; is constant. Therefore equation (79.56) may be written 
im an integrated form, namely as 


1 
| x1 + 5(Qe? + — 40)\(azy 9p (79.57) 


where K; is a new constant. Remembering that the total mass M as defined 
by (79.44) is constant we can introduce three quantities X; by means of the 
relations 


MX; = | we eu (40? +1 — 40)\(azy (79.58) 


These quantities can be interpreted as the coordinates of the mass centre of 
the system of masses, and equation (79.57), written as 


MX, —(tP,= K, (79.59) 


can be interpreted as the law of motion for the mass centre. Equation (79.08), 
which was our starting point, may be considered to be the result of differentiating 
(79.59) with respect to time. 


80. Additional Remarks on the Problem of the Motion of a System of 
Bodies. The Explicit Form of the Integrals of Motion for the Case 
of Non-Rotating Masses 

In the preceding section we introduced the integrals of motion for a system 
of bodies,. assuming the validity within each body of the non-relativistic 
equations of a motion for a continuous medium (79.01) to (79.04). The question 
arises whether the integrals of motion so found are still constant if the equations 
of motion of a continuous medium are taken to be only approximately valid 
within the bodies, as was assumed in Section 73, and if instead it is only postu- 
lated that the equations of motion of the bodies as a whole are satisfied. In 
this case one has, of course, to reintroduce the assumption that the bodies 
rotate as if nigid. 

As regards the momentum, the question just posed is very simply answered. 

The equations of motion obtained in Sections 75 and 77 amounted precisely 

to the statement that the quantities 


Pat = i Gi(da)? (80.01) 
satisfy the relation (a) 
ey (80.02) 
where, by (75.33) di 
2, Fai = (80.03) 


Therefore the fact that the quantities 
Py = J Gi(dx) (80.04) 
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are constant, is a consequence of the equations of motion of the bodies as a 


whole. 
The same applies to the quantities 


Mu = | (1iGe — cxG,)(dx) (80.05) 


which represent the angular momentum of the system. In Section 72 we wrote 
down the equations of rotational motion only in Newtonian approximation. 
But the more precise equations for the sum of orbital and intrinsic angular 
momentum are precisely of the form 


£ | (2,4, — 2,G,)(dz> = Li (80.06) 


where L@) is the sum of the same integrals that appear on the right-hand 
side of (79.20), except that here the integrals are extended over the region of 
mass (a) and not over the whole of space. Now, as was proved in Section 79, 
the relation 
YL@ =0 (80.07) 
a 
is valid and hence the constancy of the My is a consequence of the rotational 
equations (80.06). 
We now verify that the relation 


{ gV,T*(da)? = 0 (80.08) 
(a) 
is satisfied. It must hold in order that the corresponding harmonic condition 
be satisfied. On the other hand, if it does hold, so will the relation (79.06) 
which is connected with the energy integral. 
The transformations that took us from (79.06) to (79.38) are also applicable 
in the case when all integrals are extended over the volume of a single mass. 
Therefore we may write immediately 


d 


5 | of} + =(3 du? + IT — U)|(aey — | ea (dz)®=0 (80.09) 


(a) 


and since, separately 


{ o(dx)? = Ma = const. (80.10) 
(a) 
we must have 


d 
x | o(g0* + TT — U) (dey + [es (dz)? = 0 (80.11) 
(a) (a) 


This relation is fulfilled as a result of the equations of motion of a continuous 
medium (79.01) to (79.04). We must show that they will also be approximately 
satisfied if the equations of motion of a continuous medium are satisfied only 
in the mean and the body is taken to be rotating as if rigid, but not necessarily 
with constant angular velocity (see section 73). In this case our problem 
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reduces to verifying the relation 


au ; 
avi | wy — n (dx)? = 0 (80.12) 

(a) 
which follows from (80.11) by differentiation when account is taken of the fact 
that the elastic energy remains constant. Here w; is the acceleration for which 
the expression (73.07) was deduced in Section 73 ; the integrand in (80.12) is 
VE times the left-hand side of (7 3.08). The equation (80.12) is readily verifiable 
with the use of the relation 


| ena — (—) |e — aj)\(dx)? = 
Fa le a rs aj)(dx)? = 0 (80.13) 
which may be written as 

dT a al 165 4 

dt 2 \ day éx;/q oe) 


This relation coincides with (7 2.32) and is satisfied by virtue of the equation 
of the rotational motion of the body. The verification of (80.11) may also be 
performed as follows. Dividing the potential U into an internal and an external 
part and using equations (71.19) and (72.09) which express the fact that the 
resultant of the internal forces and of their moments is zero, we may write 


Oua Ue 
| ay = ge ad ia (80.15) 
Further it is evident that 
d 
di e(Il — ua)(dx)? = 0 (80.16) 


(a) 
Therefore (80.11) reduces to an equation involving only the extcrnal potential 


U‘), namely 


| p(go® — Utan(de)® + | p= (da)? = 0 (80.17) 


(a) (a) 


d 
dt 


Inserting the value (71.27) for U'™(r) into this equation and using the equations 
(71.35) and (72.32) for the translational and rotational motions we see that 
(80.17) is satisfied. 

Thus the validity of the relation (80.08) which must be required to ensure 
that the harmonic condition is satisfied is, in fact, a consequence of the remaining 
relations of the form (70.21), just as in point mechanics the equation expressing 
the conservation of energy is a consequence of the equations of motion. 

Similarly it can be verified that the equations of translational and of rotational 
motion ensure the validity of the relation (79.51) and hence of the law of motion 
of the mass centre. 

As we have already noted in Section 79 the accuracy of our approximation 
permits a derivation of the law of conservation of energy from (79.06) only in 
Newtonian approximation. However, for non-rotating bodies with spherical 
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symmetry it is not difficult to write down the energy integral in the next 
approximation also. In this case the motion will be purely translational, and 
for such motion we have derived the Lagrangian form of its equations. There- 
fore we can base the derivation of the energy integral on the Lagrangian 
examined in Section 78. We introduce the effective mass 


a 


ma = Mat (80.18) 
and then have, by (78.03) 
PSR =the (80.19) 
where by (76.19) 
1 
K => 3m ai + 5 > dm (4)? (80.20) 
a a 


As before, the quantity K, is given by the equation (78.05) in which we can 
replace M, and M, by the effective masses mg and m,: 


1 YMNaMo . . 1 i ae 
K,=— Dia (342 + 362 — 84,b,) + Te 2 ym, a,b, Qa, day |a— b| 


4c? Ja —b] b| 
(80.21) 
® is the Newtonian potential energy for the effective masses. 
YMamMo 
-5 » ja=b] (80.22) 


In view of (78.13) this expression senides the terms ®,, and ®, (incidentally 
®, = 0 because of the spherical symmetry). Finally, ‘¥ has its previous value 
(78.15), but in the approximation considered the masses involved may also be 
replaced by effective masses : 


_ Vy mamo(ma + me), ¥* 
482, fab? 62 >, mam 
a,b 9 a,b,c 
y ( 1 ai 1 1 
Jab] a=] * be bal * fe—al fe—b] 
(80.23) 
Using the relation 


aL 
> GL (80.24) 
7 Ody 


we obtain the energy integral (with the non-relativistic choice of the zero of 
energy) in the form 


1 
BS > ima +5 > 8m,(a2)2 + K,+0+¥ (80.25) 


a a 
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The remaining integrals of the motion can also be readily derived from the 
Lagrangian. We obtain the momentum integrals 


> Pat = Py (80.26) 
a 


where 


oL 
= ease YMaMd 
Pap = = Gy(mg +5qm at, — aD. = 


7 , YMamr( da = by) 


as = (as — bi)(ax — bx) 
+o ~ Ja —B| sas >, mana ~ fa—bil> 
(80.27) 
the angular momentum integrals 
>, (Pax — axPast) = Mix (80.28) 
a 
and the integrals of the mass centre motionf 
MX; ~— . t = K; = const. (80.29) 
h S YMaMp ee) 
where MX, 2 — +5 maz — a la =] (80.30) 
1 YMaM» 
and where M= D (m, +55 ms oe 2, jab] (80.31) 


is the total mass of the seal 
A more exact value of the total mass is 


E 
M= > ma += (80.32) 
a 
where & has the value (80.25). 


81. The Problem of Two Bodies of Finite Mass 

In the case of two bodies the equations of motion obtainable from the 
Lagrangian (80.19) can be integrated. In this connection it is convenient to 
change to a notation which is more usual in mechanics. To label the two masses 
we shall now use the numbers 1 and 2 instead of the letters a and 6, and we 
shall denote the coordinates of the two particles by 1, y,, z,, and 2%», Ys, 
2, respectively instead of using a and 6 here also. 

We shall also use the three-dimensional vector symbols r, and r, to describe 
the positions of the two masses. In the new notation the Lagrangian becomes 


Me). 
L=5mit +5 5 Mate + ar = (ER +e (EY 
‘i fae 
202 |r, —r.| 
1 MyM s : 
ay eras. —F2)}{(To° (Ty —T,)} 


YN MN» 2 mym,(m, + M2) (81.01) 


2c? |r, —r.|? 


se (3i2 ++ 3i2 — TE, -F,) 


|r) —Ts 
ft The constants A, should not be confused with the quantities A, and A, appearing in the 


Lagrangian. 
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and the energy integral appears as 


1. 1. 3m,,. 3M. 
E= 3 fit 5 Make aati a Be, #2) 
+ TMM (392 4 382 — THF) 


2c? |r, —r,| 


1 ymym, 


— Boi Tey agp Os Teele — T2)} 


ys MMS. Se Maal Me) (81.02) 
|r: —r,| 2c? |r, —r,|? 


The equation (80.27) for the momentum of one of the masses should also be 
restated in the new notation. The component P®, for instance, is given by 


F 1 : 1 ymym, 
a,(m, + oat — 53 i —t] 


T ymym(%,—Z2)  ymym(x, — Zp) 


Sf ae ee 28) te L03) 


The momentum of the other mass can be obtained from the last expression by 
interchanging the suffixes 1 and 2, and the total momentum will be given by 
P, = P® + P®, ete. (81.04) 


Further, we state the equation for one of the components of angular momentum, 
for instance 


Mog =e PP 9, PO Fae 97 (81.05) 
We have 
M : : 1 19 | 3 YMyM, 
ay = (iY, — YA), + rae a oe |ri —F.] 
. : 1 . 3° symm 
+ (XpH.— Yat) (m, ae aoa Met ee oe woe) 


TymyMe cots : S ; 
2c?| ry = r,| (Zo — Yt, + Leif. — Yok) 


YMNyMs 


a {fl + £2) + (ti — Te) }(tiY2 — Tay) (81.06) 


2c?|r, —r| 
If we introduce a system of coordinates fixed to the mass centre we can write 
the mass centre integral as 

Bt eo eo) 
2c? 2c? |r, — r,| 


mari(1 + 
(81.07) 
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We now introduce the coordinates of the mass centre in the Newtonian sense 


_ M40, + mr, 


r 
= ag (81.08) 
and also the relative coordinates 
r=r,—r, (81.09) 
Then we have 
ry=ryo+ —r 
0 
ce (81.10) 
r=) ——1T 
Mg 
where 
Mo = m4, + Mo (81.11) 
is the total mass. Further, we introduce the notation 
* _— MyMo 
m ete oa rea (81.12) 
for the reduced mass and note that 
mym* = MyM, (81.13) 
We also have 
m,t? + m,r2 = mt? + m*i* (81.14) 
my(244 — Yr41) + Mo(Loif2 — Yoke) = Mo(LoYo — Yoto) + m* (xy — yf) 
(81.15) 
and LoYy — LyYo = Loy — LY (81.16) 


Equation (81.07) shows directly that the radius vector of the Newtonian mass 
centre will always remain small and its rate of change will also be small. If 
we take the orders of r and r to be FR and q respectively, we have 

q?. . @£ 


a ters (81.17) 


r~R 
0 C2 


Therefore in all correction terms that contain the factor 1/c? we may replace 
(81.10) by 


pe ae ee (81.18) 
Mo Mo 
m 
and i= t; tp = ——t (81.19) 
Mo Mo 


Making these approximations in equation (81.07) itself we can write it in the 
formt 


= * 
fires es rn — 9) mat = "| (81.20) 
0 


Hence it can be concluded that in the case of a bounded motion the Newtonian 


¢ This form was first derived in the author’s paper of 1941 [38]. 
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mass centre ry will oscillate about its mean position. 

We rewrite the integrals of energy and angular momentum using the assump- 
tion that r, is small. We can neglect the quantities ry and ry even in the main 
(Newtonian) terms of (81.02) and (81.06) since according to (81.14) and (81.15) 
they appear quadratically in these; this means that the values (81.18) and 
(81.19) may beused everywhere. Introducing these values into the energy integral 
(81.02) and dividing the result by the reduced mass m* we find for the quantity 


E 
By=— (81.21) 


the expression 


1 
Ey = 5t* i 
1 (3 3m*\ . : : 2m? 

oe alg (1 = ye eo x (Big 4 om* Ee = (r-t)2 4 | 
(81.22) 

We now introduce the values of r, and r, into the angular momentum integral. 

By (81.16) and (81.20) the last term of (81.06) is equal to zero in high approx- 

mation and the remaining terms give 


1 1 3m*\ , Y ; : 
x ay = (1 + wal! — mi + aay (BM + mag — yz) (81.23) 


Similar expressions result for the other components of angular momentum. 
It is evident from these expressions that in the space of the relative coordinates 
z, y, 2 the orbit will be plane. Taking the plane of the orbit to be the zy plane 
we can put in the usual way 


z2=0; Z=0 (81.24) 
and we can introduce the polar coordinates r and @ by the equations 
Z=PTcosg; y=rsing (81.25) 


Denoting the constant on the left-hand side of (81.23) by » we get 
aK 1 om*)\ | ? Y Pee 
BL => + De2 1 = My r + or (37% + m ) Tr i) (81.26) 


Before continuing we compare the integrals of energy and angular momentum 
just derived with those obtained in Section 58 in our investigation of the 
motion of an infinitesimal mass in the field of a large or finite mass. We must 
expect that for m* = 0 the equations of the two-body problem go over into the 
equations of the one-body problem. Let us verify this. As the equations (58.26) 
to (58.31) show, the following relations held in Section 58 : 


r—a dt ae E, 
PSS ker Papua 1 oe (81.27) 
dp 
and (r+ «) =p (81.28) 


where (5)'- - ~ = — =| (4) (“) GA (2) | (81.29) 
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and where « is the gravitational radius of the large mass : it may be taken to be 


YIN 
=r (81.30) 
Putting 
: dr\2 d\ 2 
v= fr? — (5) + (2) (81.31) 
we may replace (81.29) approximately by 
(5) -¢ [1 1 , 4a\ , 4 
7) ae recr| ae ( + she (81.32) 
whence 
dl rtoa I 4a 3 u4 
a em Be) Ss ee ee Sly ae 
dt (: _ *) | a 2c? (1 r )e a 8 =] ete) 
and then 
r—adl a ve 3a 3ut a? 
a — — _— — ——— 2 _ —+ 
rtadt r Ss 2c? 3 Der ” a3 Bet — 2r? ele) 


Introducing this expression into (81.27) and replacing « by its value (81.30) 
we obtain for E, the value 


Qa 2 
Bea oe Y =) (81.35) 


which agrees with that given in (81.22) if in the latter m* is equated to zero. 
In order to compare the equations (81.26) and (81.28) we separate off the 
factor rp in the latter and write 


a\?dt do 
Le} ee = 81.36 
( , *) dt : dt ( ) 
The value of dt/dt may be taken from equation (81.33) in which terms of order 
v‘/ct may be neglected. Then we get 
a\ 2 dt 3a UP 
a (pai es dae 81.37 
(1 " *) dt er r 7 2c? ( ) 
Here we replace the gravitational radius « by its value (81.30) and insert 
(81.37) into (81.36), so getting 


(1 Pe st = (81.38) 


If m* is equated to zero in (81.26) that equation agrees with (81.38). 

Thus in the limit the integrals of motion in the two-body problem do indeed 
go over into the corresponding integrals of the equations of the geodesic which 
determine the motion of an infinitesimal mass in the field of a finite mass. We 
note that this comparison has been possible only because we have treated both 
problems in terms of the same, namely harmonic, coordinates. 

We now continue the discussion of the equations for two finite masses and 
derive the equations of the trajectory of their relative motion. We shall retain 
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the symbol v? in place of £? to denote the square of the velocity. If we put also 


uU=- (81.39) 
r 
we have the identity 
dp\? 2) 2 
2— [r?— — ‘ 81.40 
: ( i) (zs ae ( ) 


whence, after multiplying by the square of the quantity that appears multi- 
plied by r2 in (81.26) we find 


1 ( 3m* du 
tex 


2 2 
Jos 4 X (3m, + m*)v? = Hf (=) i: wt) (81.41) 
cr dp 
We can now insert on the left the value of v? given by the energy integral 
(81.22) with (r - r)? expressed as 
(r -f)? = r?-v? — pp? (81.42) 


This insertion reduces (81.22) for Ey to 


y+ 


Mo 


1 YMo 3 3m* Y ym yn*y2 
E,= =v —-— wa (1 ae + Fg (Gm + 2m")? + Fe oes 
(81.43) 


We solve this equation approximately for v? and insert the solution into (81.41). 
Replacing 1/r by u we then get 


du\? 1 3m 
(7) +u!] = 2B, +5 (1 — ag 


do Mo 


1 3m* 
medi +3 (t—2%} 
c? Mo 
3y2m? m* ym* 
SiO {9 dye a ys 
+ moe Ju + or ue (81.44) 
For m* = 0 this equation goes over into (58.36) in the approximation con- 
sidered. To make the discussion of this equation more convenient we replace 
the integration constants HZ, and uw by two new constants a and p, where 


ym 
Ey= 33 w= yinop (81.45) 
Further we put, as in (81.30) : 
ym ym* 
ae =. aoe (81.46) 


In Newtonian approximation the quantities a and p represent the major semi- 
axis and the parameter of an ellipse (we confine ourselves to the case of a 
bounded motion). With the new notation (81.44) appears as 


3) . 1 a — 3a* 2 4a 30% 
(7 ap 4a*p = p ap 2 ap u 


6a  3a* 
= (1 ea a + aku? — (81.47) 
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For a>p the polynomial on the right-hand side of this equation has two 
positive roots u, and wu, that are close to the roots of the equation 


1 4. 2 ; 
| ng = (81.48) 
which are 
l+e l—e 
“= ra uy = (81.49) 
; Pp 
with 
Pp 
1—e® == 
a=, (81.50) 
In addition there is a large root u, for which one easily finds the relation 
6a — a* 
aku, = 1 — F (81.51) 
The differential equation (81.47) can therefore be written as 
du\ 2 ee 
(7 = (u, — uu — u)(1 ee - apes ou) (81.52) 
We make the substitution 
Uy tu, | Uy — Ue 
tS 5 + 5 008 p (81.53) 
Then the equation for ¢) is 
3) 6a = a*e 
—] =1—— — — cos 81.54 
(Z = cosy (81.58) 


Here we have introduced the approximate values for the roots given by (81.49) 
into the correction term. Hence 


dp = 
i 
If r vanes from its greatest to its least value and back, ) changes by 2x. In 


that time the angle » increases by somewhat more than 27, say by 2x + Ag 
where 


3a ake 
1 ae eis 
+ - + Op cos (81.55) 


Ag = oe (81.56) 
Pp 

This equation has the same form as the equation (58.43) which gives the dis- 
placement of the perihelion in one period of revolution, but the constants 
involved have a somewhat different meaning. For a given parameter p the 
displacement depends only on the sum of the masses of the two components 
of the system (e.g. a double star). Of the constants of integration only the 
angular momentum is involved in this expression for the displacement. 

The presence of the cubic term in the differential equations (81.47) has the 
effect that the orbit of relative motion is not a precessing ellipse, but a preces- 
sing curve of a more complicated form which, however, does not differ greatly 
from an ellipse [19]. 


CHAPTER VII 


APPROXIMATE SOLUTIONS, CONSERVATION LAWS 
AND SOME QUESTIONS OF PRINCIPLE 


82. The Gravitational Potentials for Non-Rotating Bodies (Spatial 
Components) 

In the preceding chapter we determined the gravitational potentials only 
with the accuracy required to derive the equations of motion of a system of 
bodies. We shall now concern ourselves with a more precise evaluation of the 
gravitational potentials, but in view of the complexity of the problem we shall 
only deal with the case of non-rotating spherically symmetric masses. 

In Section 67 we stated the equations 


1 4U 4S 
00 _ _ xen ES. < 
gM a— 4+ +5 (82.01) 
4U, 48 
gt = = ab — (82.02) 
4S 
and gt = Sy, + = (82.03) 


in which U is the Newtonian potential and U; the gravitational vector potential. 
The terms involving S, S; and Sy are corrections. We did not evaluate these in 
detail because it was sufficient for the derivation of the equations of motion to 
know, in addition to the vector potential U;, the single quantity 


1 
U*¥=U+—(S + Six — 20% (82.04) 
c 
(see equation (67.11)) which, by (68.30), satisfies the equation 
AU* — — —_ = — dy (o2 00 4. THR) (82.05) 


Now we shall also evaluate the correction terms, taking as our starting point 


the approximate form of Einstein’s equations obtained in Section 68. 
1 1 ag 8 
= UV 6 + g v 
5,40" se tH = E Tu (82.06) 


We consider first the equation for the g**. In it we retain terms of fourth order 
in 1/c, but discard all terms of higher orders. 
According to (66.07) we can then, put 


— gT* = pv, — pix (82.07) 
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and in the expression (68.16) for N‘* we can take the main terms 


2 
eee A (82.08) 


Here we have put 
Qiz = 4 §u(grad U)? — — —_— (82.09) 


as in (71.16). We can neglect the second derivative with respect to time in the 
equation for g**. Introducing the value (82.03) for gt*, multiplying by 4ct and 
transferring the term Qjx to the right-hand side we obtain the following equation 
for Se: 


ASte = Qiz — 4rry(purve — pix) (82.10) 
We note that the gravitational vector potential U; satisfies the equation 
AU, = Arypuy (82.11) 


and that, since the Newtonian potential U satisfies Poisson’s equation we 
have 


80% 4 aU 
eas (82.12) 
Therefore equations (82.10) and (82.11) lead to 
ou; =) (02) O(pv¢vx) aU ap 
DU ee ae eee 
( a | Ot Oca age 8 Oe, pa Gel) 


and by virtue of the equations of motion for a continuous medium written 
in the form (66.13) we get 


ou; =) 
A{— + —] = 
( oe (82.14) 
Since this last relation holds throughout space we must have 
OU, | Se 
ee (82.15) 


i.e. the harmonic conditions are satisfied. When we obtain the explicit expres- 
sions for the Sq we shall be able to verify by direct substitution that (82.15) 
is valid. 

We shall seek to find the functions Sq, in the form 


Sec = Vex + Vie (82.16) 
where Uy and Vix satisfy the equations 
AU te = — 4ry(ovtve — pre) (82.17) 
and 
AVex = Qe (82.18) 


Inserting the expression 
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for the Newtonian potential into the equaticn (82.09) for Qi, we can write 


On Zi > Qn + > Qiao) (82.20) 
a a#b 
wheret 
0Ug Oe 
(aa) — 484,(grad ua)? — ee (82.21) 
and 


OU Cun Cua Oud 


b) — 1 . —1)/— — 4+ — 82.22 
Qe 48:(grad wa: grad wp) (7 Crp, ONE =) ( 


In accordance with (82.20) we assume that the solution Vix of (82.18) is of the 
form 


Van = > VE” 2 Var) (82.23) 
a 
with the separate terms satisfying the equations 
AVG®) = Qa (82.24) 
and AV) = Q@) (82.25) 


We shall find explicit solutions of these last equations for the case of non- 
rotating, spherically symmetric masses. In this case the Newtonian potential 
of mass (a) in the space outside that mass is 


yMa 


eae] 


(82.26) 


and the expressions (82.21) and (82.22) for the Q@” and the Q{¢” take on the 
form 


aa Stk (xi — a1)(tz — ax) 
p —yMih [r —al® (82.27) 
ee (x — a3)(xy — by) 
WW = AM My ee Stk 
_ (% = a)(@e — br) + (ar — Bi) (Te — ax) 
alr Sep (82.28) 


These expressions are valid outside the masses. Within the masses one must, 
strictly speaking, use the more accurate expressions (82.21) and (82.22). 
However, the difference between the exact and the approximate values of 
the Q@) will not appreciably influence the values of the Vix, at least in the 
case that the linear dimensions of the masses are small in comparison with their 
mutual distances. Therefore we shall take the quantity Q{¢”) appearing on the 
right-hand side of (82.25) to be given by the expression (82.28) throughout 
space. At the points r = a and r = b this expression has singularities that are 


+ In Section 71 the quantity now called Q{2” was denoted by 9{). 
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no higher than of dipole character. Consequently, the equation (82.25) for 
V& will have a solution which remains finite throughout space, including the 
points r = a and r = b and which tends to zero at infinity. 

This solution can be given in closed form. To do this we express the quantity 
(82.28) in terms of derivatives of the function 1/|r — a| |r — b| with respect 
to the parameters a; and b;. We have 


M,M, ( 02 o 2 1 


2 * 7 a r—a| |r —b| 


(ab) —. .,2 
Ot Y da; ab; Caz bx day, Gb, 
(82.29) 


Therefore (82.25) may be reduced to the simpler equation 
1 


ao |r —al| |r —b| 


(82.30) 


Indced, if ¢ is a solution of (82.30) then the quantity 


Vege) — iat b (31 etn 28 aN, Oe) ) (82.31) 
0a; ab; 0a; obx Oar ob; 
is a solution of (82.25). 
Now, it is simple to write down the required solution of (82.30). We denote 
by s the perimeter of the triangle whose vertices are at the points a, b and r: 


s = |r —a| + |r —b| + |a —b| (82.32) 
Then the function 
p = log s = log (|r —a| + |r —b| + Ja —b]) (82.33) 
satisfies (82.30), for we have 
1 
Al = ———____—_— : 
og s jr —al jr —b] (82.34) 
Therefore the required solution of (82.25) has the form 
y7aM ( logs alogs da log * 
(ab) — Ce Oe ee eh ae g ee 82.35 
Vae 3 "3a; bj 0a, Ody ay Oy eo) 


It is easy to verify that this expression remains finite everywhere and tends to 


zero at infinity. 
We now turn to the equation (82.24) for V4. Shifting the origin of co- 
ordinates to the point a and using the spherical symmetry of our problem we 


can write the equation as 


AVI? = (35. = et) (82.36) 
r 


2 


For brevity we have put 
a = u(r) (82.37) 


We try to find a solution of (82.36) of the form 
V2 — — Byego(r) + (xen — 48eur*)Q2(7) (82.38) 
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Then q, and g, must satisfy the equations 


ote : fe = = u'(r)? (82.39) 
d* 6 Log 


Hence, using the boundary conditions, we obtain 


@ 5 
1 
qo(7) = ; | ru'(r)? dr + re | r2u'(r)? dr (82.41) 
r 0 
co r 
d ee "(r)? dr + : r4u'(r)? dr (82.42) 
on a= 5 {> za | ; 
r 0 
If the radius of the body is Z we have forr > L: 
sje es (82.43) 
and the expressions for g(r) and qo(r) reduce to the following : 
~My 
gt) = G+ (82.44) 
yu? 22 
and g(r) = vi : (82.45) 
=e 
Here/ Ma = La { r4u'(r)? dr (82,46) 
0 
is a length of the order of Z and the quantity 
Eq = = | (grad uq)*(dz)3 = = aus dr (82.47) 
8rcy 2y , 
0 


is the negative of the gravitational energy of the body. If we insert the expres- 
sions obtained for q,(r) and q,(r) into (82.38) we get 


PMN 


V(aa) — 
tk 
5s 


°M x ax 
Y ki } ouk c = # (x, Ck — 4 Suxr ‘) (82.48) 


or if we return to our original variables by replacing 2 by 24 — ay: 


: : M(x, — a,)(a 
Vg? = ee 18a 


=a 
°M?), 
a [Fale | (ee — a)(rz = ax) = 4 dux(r = a)| (82.49) 
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Here the last term is small compared to the first as long as the distance from 
body (a) is great compared to its radius. Strictly speaking that term should 
be discarded because the same Syprosmaton has already been used in cal- 
culating V{2>). : 

It remains to write down the solution of (82.17). For non-rotating masses 
we can put 


mw=d; Pir = — prix (82.50) 
with p determined from the equations 
dp = p dug (82.51) 
whereby, according to (74.24) 
| pldz) = fee (82.52) 
(a) 
We then have : 
1M atid YEa 
Ux = 434 —— : 
ti ial + 2D B80 ay (82.53) 


Here the dipole terms are zero because of the spherical symmetry of the bodies 
and terms of higher order in Z/R (i.e. of the same order as the last term in 
(82.49)) will be discarded. 


We now put 
yM 4,4, , y°Mi(z, — a,)(a, — ,) 
ee) ieeal aie Sal Gees) 
*MaM» ( logs da logs -) 
(ab) _ yar) _ Y “tai? Oe eS ee 82.55 
ane St Va 2 Oa; 2b; Gay Oe = Ay, OD Cr 


Denoting by Si, the sum 
= > Sao + > Sir (82.56) 
a#b 


and inserting this expression into the equation 
4 
gi = — cbr + Stk (82.03) 


we obtain explicit expressions for the spatial components of the metric tensor. 
These expressions are also valid in the region of the system itself, outside the 


actual masses. 
For the mixed components of the metric tensor we had the approximate 


expression 
oo = — 0, (82.57) 


where U, is a solution of (82.11) which, in the approximation considered, is 
equal to 


Up > re (82.58) 


r—a| 


(see equation (76.23)). 
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We can now verify that the explicit expressions found for Sq and for OF 
satisfy the harmonic condition (82. 15). We have 


OU i. asia) 7M att 
a 82.59 
“at “Oak > |r — |r —al ( ) 
Also, using the relation 
Blogs _ (ral + |r —b] — Ja —b]) (82.60) 
Gay Gb; 2|r —al |r —b| |a —b| 


and the two further relations that,may be obtained from (82.60) by permutation 
of the letters a, b and r, we can verify that 
aia) 
OL 


b ( 1 1 
= HMM =e ay ro) ee 


Summing this expression over a and b we get 


asia) ° (az = ba) 1 
= > MM, ——_, ——_ (82.62) 
ab Ont 2, |a —b|? |r —a| 
yM aM b 
Hence, putting ® = —t} >) _———_| (82.63) 
2 |a —b| 
oS 
we have es ea r = sa (82.64) 


Combination of (82.59) and (82.64) finally gives 


ou; OSik Y bs o® 
Stas s 2 recy (nea a *) (82.65) 
Now, ® is the Newtonian potential energy of the bodies. Therefore 


a@ 
Mat + = =0 (82.66) 


by virtue of Newton’s equations of motion. Hence the entire expression 
(82.65) will vanish, in agreement with (82.15). By reversing the argument 
(as was done in our paper of 1939[34]) we could have derived Newton’s equations 
of motion (82.66) from (82.15) and (82.65). 


83. The Gravitational Potentials for Non-Rotating Bodies (Mixed and 
Temporal Components) 

We now derive expressions for the mixed components q™ of the gravitational 
potential to the next approximation beyond that represented by equations 
(82.57) and (82.58). We must put p =0 and v =7 in (82.06) and insert into 
that equation the value of N% given by (68.15) and the value of T° given by 
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(66.07), but with the simplification that the stress py, reduces to an isotropic 
pressure p and that the velocities v; = a; within each body are constant. We 
write equation (68.15) for N™ in the form 


2 
Narra Gs (83.01) 
6U 0U eu, é8U;\ aU 
where Se eel ay | peace eae eng Vea 
@ at day (= =) Om; en 
Equation (82.06) then appears as 
1 1 age 2 Brey 
De Ag ~~ 908 OE => a8 Q: = rae (c? + 4U)T% (83.03) 
where, according to (66.07), we have for isotropic pressure : 
(2 +4U)T = ofp + £ dot +11 +80) + zl (83.04) 
If we insert into (83.03) the expression 
4U, 48 
of = = (83.05) 
for gq we can satisfy that equation if we demand that 
1 aU 
AU; — = = = — 4ry(c? + 4U)T% (83.06) 
1 028; 
and AS; 62 Bp = 0: (83.07) 


Since these expressions involve the speed of light as a parameter the quantities 
U; and S; will no longer be coefficients in the expansion of g™ in inverse powers 
of c. However, this inconsistency in their definition is unimportant because 
equation (83.06) for U; coincides to first approximation with (82.11). 

To write down the solution of (83.06) for the region outside the masses we 
must know the value of the volume integral of the right-hand side of that 
equation taken over the volume of each mass. To evaluate this integral we 
recall the relation 

oll — pua +p =0 (83.08) 
to which equation (73.26) reduces in the absence of rotation. According to 
(74.07) and (74.24) we have 


{ oug(dx)® = 2eq; | p(dz)? = hea (83.09) 
@) , (a) 
Hence (83.08) gives the relation 


| oll(dic)® = Seq (83.10) 
(a) 
Using these results we obtain 


ce 


@ (83.11) 


. ] 
| (c2 + 4U)T% (dz)? = Mair “| = [dre -+ sU(a)| -f- . Egil; 
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where U‘@ is the external potential introduced in Section 71. Here the evaluation 
is much-simpler than in Section 76 because we do not consider atin a2 

We note that the quantity (83.10) is the internal (elastic) energy of the : y. 
Since its gravitational energy is the negative of eq the sum of the two quanti a 
(2/3)eq, is the energy that must be taken in calculating the effective mass. Ihe 
latter will bet 


2¢€ 
mg = Ma +5 A (83.12) 
in agreement with (76.18) and (80.18). 
Using (83.11) we can write the approximate solution of (83.06) in the form 


j 8< 
yu Ma ( ao 
Ue Deal [Ma + Tiedeg +30) + S| 
° 1 2 
cee i 22 83.13 
+ yes ap D, Model a| ( ) 


Here the last term represents a retardation correction. B As'a, avd 
We now pass to the equation (83.07) for S;. Here the second derivative with 
respect to time way be neglected so that the equation takes on the form 


AS: = Q: (83.14) 
As in (82.20) the quantity Q; can be decomposed in the form 
Q, = > QP + > Qe (83.15) 
a a#b 


where Q@® is a quadratic function of the first derivatives of the potential 
of mass (a) and Q{2°) is a bilinear function of first derivatives of the potentials 
of two masses (a) and (b). Corresponding to this decomposition the solution of 
(83.14) can be written in the form 


S, = > Se + > Sev (83.16) 
a a#b 
with the separate terms satisfying 
AS@®) = Qa) (83.17) 
and AS{2>) = Qe) (83.18) 


For Qj*” we take an expression analogous to (82.21) which is valid even within 
the mass in question : 
Oug Gug 


aa) — gi _ 
% i| Ox Ox; 


For Q{?) we confine ourselves to an expression that is correct outside the 
masses and is analogous to (82.28), namely 


a 1.2 aus (xj — a;)(x_ — b ) 
OP = HPIEM 8, — aby 
ae (36, _ 4a) (x4 — a4) (x4 = by) a A(dg as b,) (x; = a3)(x4 = b;) 


(83.20) 


} It is important to bear in mind the relation (83.12) when comparing the present equa- 
tions with those of our 1939 paper [34]. 


+ 481(grad ua) (83.19) 
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To solve (83.17) we express its right-hand side in the form 


. Qe) = a, Qa) aE 74, Qe” (83.21) 
where the Q{% are the same as in Section 82. It evidently has the solution 
- SEO = 4, VE" + 14, V90) (83.22) 


where V{2) is the previously determined solution (82.38) or (82.49) of equation 
(82.24). 

Passing on to equation (83.18) we represent its right-hand side, i.e. the 
expression (83.20), in the form 


Qfo — ; MM, (4, — 4b,) a + (36, — 44,) or + A(Gy + :) =a 
x Poa roa (83.23) 
It follows from (82.34) that (83.18) is solved by 
Sad) — : MM, (4, — 4b,) i 
+ (3b, ~ 44,) = = +4(4, +6,) - “3° (83.24) 


We shall also write down the value of S{**) outside the masses. To do this we 
must insert the value (82.49) of V{® into (83.22). The terms containing Ag 
rapidly tend to zero at large distances and can be omitted. In the expression 
so obtained for S{@” it is convenient to separate off a harmonic term, writing 


22 YE Raa) 
Seo = 3 4 [r ey + 8%? (83.25) 
. (a — a4) (aj — ay) TT 
where Sea) = 7M? ay —4[r—al* = a] r I F—al? = a] : (83.26) 


Insertion of these expressions into (83.16) gives the value of S; which can be 
combined with the value (83.13) of U; to determine g™ from (83.05). The 
equation for g% may be written ih the form 


eee ee (83.27) 
3 c$ 
a. %, Ma 
where U, = 2. merits + oe + as 302 + 3U@ )] 
1 
toam 2 Moti |r —a| (83.28) 
and 5, = > Seo + > Siar (83.29) 
a axzb 


Here the quantities U, and S; differ respectively from U;, and S; by harmonic 
terms containing ez. We note that eq enters U; only through the effective 
mass (83.12). The sum S; is a homogeneous quadratic function of the masses, 
whereas U, depends on them linearly (apart from terms containing U“(a)). 
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It remains for us to find an explicit expression for the temporal component 
of the gravitational potential, i.e. for the quantity g°°. In Section 68 we have 
already written down the equations determining g®. By (68.43) and (68.41) 
we have 


Pee) ee 0? (83.30) 
c 68 c8 
where U is a generalized Newtonian potential satisfying the equation 
2 ‘ ; 
AU — = anv = —4ry (c? + 4U) T™ (83.31) 
c® at? 


This equation has on its right-hand side the expression 
(c2 + 2U)T =p + 5 (Ju? + II — 30) (83.32) 


which follows from (66.07) and was used in Section 79. The potential U can 
be evaluated in a manner similar to the procedure followed for U in Section 
77, with the simplification that now the masses are taken to be spherically 
symmetric and non-rotating. Using (83.09) and (83.10) we get 

2 M 

te ges a — U@ (a)} (83.33) 


3c? % 


[ect + 4U)T (dx)? = M, + 
(a) 
and therefore 


= Y 2 M, Wa 
0= Dt et get Mt U' (a)} 


1 2 
+ agp BaD. Male — a (83.34) 


Insertion of this expression into (83.30) gives g®. This concludes the deter- 
mination of the gravitational potentials in second approximation. 
We must still verify that the expressions obtained for g and g” satisfy the 
harmonic condition 
ag ag 
—~—j— =0 3.35 
ot ox, Pe) 


Differentiation of the expressions (83.34) and (83.28) for U and U; gives directly 
a 00, 7 2 yMadiU' (a) 1 YM ( . 1d ®) 
es 


“Ot om 20 Bay, — |r — al es |r —al 2° dt 
(83.36) 
Further we have by (83.26) 
asa) 70 M2 
du, —(ité‘i kL |r —al? ret) 
In calculating the expression 
aS@) 
to ae 
Ox; 2 i MaM 
: a dlogs 0 dlogs é d*lo 
De Bap ab a ah os aay ES i see ee 
Ca; ab; Ox; ( 4 “) ab; 0a; Ox; Aa; mu 64) OX; 0a; Ob; 


(83.38) 
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we must use equation- (82. 60) and two further equations obtained from it by 
permutation of a, 6 and z ;-in addition we must bear in mind when transforming 


(83.38) that log s and its derivatives depend only on coordinate differences, 
so that, for instance 


(+24 @\1 ar, 
2a; ats) gs= (83.39) 


We can combine all terms in (83.38) which contain |r — al |r — bj, then 
terms containing |r — a] |a — bj and finally terms containing |r — bj Ja — dl, 
getting 


AS(ev) 1 ‘ F) a i] 
=-y*M — Ta; — — er 
Oxy res oit,| 7% da, 16, ETF —al |r —b| 
1 ] 
-+7°*M Td ieee Ree ee ee 
rat ool ~ 4 6 haley la — | 
] 


1 
+ i 7°M eM {- 1b, az (dy + by) =| (83.40) 


[FB fa— 87 
Summing this expression over a and b (omitting the term a = b) and adding 
to (83.37) summed over a, we i 
as, 7 1 DS ets a yMadiU (a) 
i bn S 


ay 2) ae a |r ~ |[r—al 
+See a) ee 


Now if in equation (83.30) U is replaced by U in the correction term the left- 
hand side of the harmonic condition (83.35) differs only by the factor 4/c* from 


the expression 


au 8s: 7. a ay, ahs cams 2) 

a teeta gee a) 2 Poa is=al ali a 
(83.42) 
which is obtained by adding to (83.36) the expression (83.41) divided by c? 


and transferring one term to the left-hand side. 
But we have 


ou ®) eo 
M = ——_ 83.43 
( Om sa 0a; ( ) 
where ® is the Newtonian potential energy 
= 5 et MMe (83.44) 
2 iF |a ~~ Ja —b] 
Therefore, by virtue of Newton’s equations of motion, we have 
(a) 
Go ( ae ©) (83.45) 
Or; a 


and the right-hand side of (83.42) vanishes. Thus the harmonic condition 
(83.35) is satisfied. 


330 The Theory of Space Time and Gravitation 


In order to obtain for the gravitational potentials not too complicated 
explicit expressions which remain valid in the region of the masses (though 
outside the actual masses) we were obliged to limit the discussion of this and 
the preceding section rather drastically. We had to assume that the masses 
were spherically symmetric and that they did not rotate. In the following 
sections, in a discussion of the gravitational potentials at large distances from a 
system of bodies, we shall free ourselves from these limitations. 


84. Gravitational Potentials at Large Distances from a System of 
Bodies (Spatial Components) 

In this section and the next we shall derive explicit expressions for the 
gravitational potentials which are valid at moderately large distances from a 
system of bodies. We call distances ‘“‘ moderately large” if they are large 
compared to the dimensions of the system yet small in comparison with the 
lengths of the waves radiated by the system (see Section 64). Our assumptions 
concerning the internal structure of the system will be as general as those used 
in Section 79 in deriving the integrals of the motion for a system of bodies. 

We begin with the determination of the spatial components g**. Asin Section 
82 we have 


4 
ar = — Bee +S Ste (84.01) 
where, as in (82.16) to (82.18), 
See = Oi + Viz (84.02) 
and the functions Ug and V4 satisfy the equations 
AUeE => Atcy (ove — Ptk) (84.03) 
and 
oU aU 
AVex = 484% (grad UV)? — — — (84.04) 
0X4 OLE 


The right-hand side of (84.03) differs from zero only within the masses and, 
using mathematical terminology, it has moments of all orders (see Section 
70). This means that if it is multiplied by a product of coordinates of any 
order and then integrated over all space, the integral is convergent. It is, there- 
fore, possible to write a solution of (84.03) valid outside the system of masses 
as a series of spherical harmonics, i.e. as a multipole expansion. We shall take 
as the origin of coordinates some point within the system of masses, not 
necessarily their mass centre. We obtain 


Oy, =U, + UM +... (84.05) 


where the first two terms are 


Y : 
Ut =" [ (ooim, — pulley (84.06 
and 


Yas 
UR = or { (9%), — Py,)t,(da)$ (84.07) 


Approximate Solutions 331 


On the other hand the right-hand side of (84.04) is different from zero 
throughout space and onl; decreases as the inverse fourth power of distance. 
Therefore only a zero order moment exists and the solution of the equation 
cannot be represented as a series similar to (84.05). 

To find a solution of (84.04) we introduce into its right-hand side the integral 
representation of the Newtonian potential 


, dx’) 
7%) 2k 
¥ | = (84.08) 
We then obtain a relation which may be written as 


1 
AVan= 57" | | 0” (axe (dey 
2 


7) 2 oO? 1 
[Bu a a Bae ~ Beet ooo oo SC—~—é« 84-108) 


Ox, Ox; = Ox, a, = Oa, On! |r —r’| |r —r’| 
We know from Section 82 (equations (82.33) and (82.34)) that the function 


log s = log(|r —r’ 


pleat’ | bile ee |) (84.10). 


satisfies the equation 
1 


|r —r’| |r —r"| ae. 


A logs = 
Hence it is simple to deduce that the solution of (84.09) that is everywhere 
fimte and that tends to zero at infinity, may be represented in the form 


lis logs dlogs log | 
Vass f \39” “ya pe, ef oS 84.12 
hor {f eee ae) [Bu dx, Ox; 0x, Oa, «=O, Ox; ( ) 


We are interested in the value of this expression at large distances. To evaluate 
it we must find an expansion of s and of log s in inverse powers of r which is 
valid for large r and finite 7’ and r”. We have 


: - LL, 2,2; 2 anne ue) Oe Bk, 
(84.13) 
whence 
1 ; , a(x, + 27) 
log s = log 2r +5. (t —r"| — : 
“h(a! +a7)|ro —r"| + a (72 + 7”? 4. Iara") 
Se 473 (2; ,] 8r2 4 
— Hite (Zan + Baitet + ate + alan) +... (84.14) 


‘Srt 
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By differentiation we obtain the symmetrized and antisymmetrized second 
derivatives with respect to x, and 2;. 
1 (oe a? log ) _ i (3. si 
2 \eu, dx, Ox; On, 4r? 
< ( wm, (ay + *))( Sn (% — MNO, 7 *»)) (84.15) 
2r 4r3 |r’ —r”| |r’ —r”|? 
and 


ak ee Te Te Se 484,16) 


“ 


2\ dn; ax 7 Ox; Ox, 

Here we have discarded terms that decrease more rapidly than 1/r?. We put 
4 =l and k =1 in (84.15) and sum over IJ. This gives 

logs 1 | (: - a, (a4 + “») 1 (84.17) 


r 273 r’—r 


(ses é? log ) tL, — 2, CBee 


Gar, Oxy = Dr? 


in agreement with the exact relation 
logs _ 1 _ ( 1 1 ) 1 
on, dx, = -2|r —1'| |r —r"| 2 |r —r'| |r —r’| |r’ —r”| 
(84.18) 


We insert (84.15) and (84.17) into (84.12) and consider first the terms that 
decrease as 1/r. Denoting the corresponding terms in Vx, by V9, we obtain 
‘e , , ” ” (x, _ @,)(2, J Ly) 
. = dxr’\3p” (da”)3 —S$_*" kk" 84.19 
Vie rT fe (dx')%e (dz") jr’ —r’|8 ( ) 
Only the part of the integrand symmetric in v’ and z” is important. We there- 


fore replace the factor 2; — xj by 2x; and perform the integration over 2”. 
This gives 


yf oe 

i= fa e (dx) (84.20) 

or, since this expression is known to be symmetric in i and 

Y oU a 
ta — — a 2 
tk XI (a ary are Ly, On; Pp (dz) (84. 1) 
(see also the lemma in Section 79). We note that this expression is also equal to 
1 1 au aU 
pos f fe 2 _ Nd) 

ti re | ( 5 5,,(grad U) = =| (dz) (84,22) 


as was to be expected because Viz satisfies (84.04) and the volume integral of 
the right-hand side of that equation is finite. 

We now examine the terms in (84.15) and (84.17) which decrease as 1/r?. 
Among them are some that are independent of x’ and x’. These contribute to 
Viz the quantity 

j 2M aya 
Vi= = (84.23) 


where M is the total mass of the system. 
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The remaining terms of order 1/r? are of dipole character. They are 


2a fae ” tt PPS oe 
Vo = — ues e’ (dz')®o” (da”) ty ws a; (t, — )(a, — 2) 


9y3 | rr’ —_ r”| 3 (84.24) 


But we have the identity 


f 


[zxj(a, — 24) + 2, 0;(a, — 2]) — L,a,(2, — 2;)] 
+ [xjx; (a, — 2) + ayn; (ap — a) — apay(x; — 2,)) 
= (a, + v5)\(a, — ja, — a) (84.25) 
in which the two expressions in square brackets on the left are related by the 
interchange of x’ and 2”. When performing the integration in (84.24) we can 


therefore replace the factor (84.25) by twice one of the square brackets. We 


then perform the integration over the variable that enters the square bracket, 
linearly, so obtaining 


yx aU aU --) 
vo — 9 ( eae —— = fp — 7 84.26 
te = p73 | (WM GE + Bite DE — wie a,}° (dz) (84.26) 


We now combine those terms in Uy, and Vy, that have the same multipole 
character. Putting 


So, = US, + V3, (84.27) 
and adding (84.06) and (84.21) we get 


= we 9 ou --)| 3 84.28 
Sie = © | {een 2P iy 1 of Oni + Xs Ba (dz) (84.28) 


Writing the term involving pix in the form 


Opyk Ty 

_9 3 as —— (dx? 84.29 
[ ratdey = | (oe BE + oe PZ) (an (84.29) 

and using the equations of motion for the internal problem 

ou Opyt 
Bi ee 84.30 
Bae rae Te ( ) 
we obtain from (84.28) 
So. = tl (rw, + ©, + 2u,v,)(dz)? (84.31) 
d 

or S= x aa on, x,,(da)® (84.32) 


Thus, the quantity S%, has been finally transformed to a form involving the 
second time derivative of the corresponding moment of inertia. 
We now transform the quantity 


se — UY + VY (84.33) 
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From (84.07) and (84.26) we get 


oU oU (4) 
Y7j 3 
SQ) = 573 14 | lovin, — 27,2 ee of hy ae + Ly —— oo — Zl, Ai ==) (aa) 
. (84.34) 
Using the identity 
o ODks Opis Opys 
as (xjxepus + LjLEPta — LeTnPys) = 2XyPix + TjTH Ee + ZL ae LiLk 2a, 
(84.35) 


we represent the integral involving py as 


8 y op; 
—2 | pac(aa? = | (=e ee + ayoe om — aoe 2) (dz)? (84.36) 


With the aid of the equations of motion (84.30) we can then write 


St. es ee | e(2r,0,0, ae LiL iWy + a =< 2,0,w,)(dx)? (84.37) 


273 
d 
or Sy = = dt { P(x,2,V, + 2,2,V, — @,,,0,)(dz)° (84.38) 


Hence, the equation 
Sey = SY a SR a Vix (84.39) 


gives us Sq, and if we insert this value of Siz, into (84.01) we obtain the following 
final expression for g** : 


1 a? 
gt* = — ik +— or | 0x 4rx(dx)8 


2ya; d 2M are 
5 Cr dt O(2LyVe + LzTyVs — “iE xv;)(dax)? + ar ae 


(84.40) 
We have obtained explicit expressions for the spatial components of the 
gravitational potential which are valid at large distances (in the sense defined 
above), from the system of bodies. Among the terms containing c* in the 
denominator those have been omitted which decrease more rapidly than 1/r?. 

We note that for a single concentrated mass the expression (84.40) reduces 


to that obtained by rigorous solution of the gravitational equations (equation 
(58.12)). 


85. Gravitational Potentials at Large Distances from a System of 
Bodies (Mixed and Temporal Components) 


To determine the mixed components at large distances we return to the 
equations given at the ere of Section 83. We have 


2c ae 303 a onG W% — aa + 4U)T% (85.01) 

aU aU aU au;\ aU 

where A gees, alah ( t.. alae 
i at oa, 0a, a4) Oa, (85.02) 
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and, according to (66.07) 


o 1 
(c? + 4U)T% = eu(1 + a Get ++ 30)) = 2 pits (85.03) 
c 
As before, we put 
get 4 = (85.04) 


and subject U; and S; to the equations 
1 aU; 


AU, — 5S = — Amc? + 4U)re (85.05) 
1 22g; 

and AS; — roe a 0: (85.06) 

As in (75.18) we introduce the quantity 

. 1 
Wasy | o’|r —r'|(dx’)? (85.07) 
and also 
1 

Wi= 57 | (pv,)’|r —r’| (de’ (85.08) 


We can then write 


e+4Urey 1 ew 
Ur:=y | ea Y+s ae (85.09) 


Here the last term is a correction for retardation. We now perform the expan- 


sion of the integral into multipoles and confine ourselves to the first two terms 
and an approximate value of the third term. This gives 


U;= t i (c? + 4U)T%(dx)® + se (c? + 4U)T%z(de)? 


oO? 1 aw 
us | evi xx(dx)® + — : 


ae 85.10 
Ox; Oxy Zr c® at? ( ) 


In this expression terms containing the momentum and the angular momen- 
tum of the system can be separated off. According to (79.19) and (79.34) 
those constants of the motion can be expressed in terms of the function G; 
which, according to (79.18), is related to the quantity in (85.03) by 


= ( —) 85.11 
(2 + 40)T% = Gs +5 40; + ek (85.11) 
We then have 
P= J G(dz)? (85.12) 
and Mu= | (Gy — rxGo(dx)? (85.13) 


We shall perform the transformation indicated after we have evaluated S; 
because a number of terms in this quantity can be combined with terms of Uj. 
In the equation (85.06) for S; we again neglect the second time derivative and 


write 
AS; = Q (85.14) 
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Just as was done in (84.09), Q; can be written as a twofold volume integral 


. : 1 ( ak 4 e2 
oy 2 , uw tow sos 4 
= | feaen iae PieeE) ° | 2 Ox, Oxy, Ox, oz; 
Q2 7/2 Q2 1 . 
a 4S:x 7 "7 iA 7 7 7 ) 7 7 (85.15) 
Ox Ox = 2\ On, Ox, =O, Ox, |r —r [|r —r | 


whence we immediately obtain the solution of (85.14) as 


dn’\*(d") are | spre 4. ? log a) 
s= | fle) ete YOM) a) Ox, Ox, On, Ox; 
2 2 2 
$B es Te ee} (65.16) 
Ox, Om, 2\0x, OE, OX, Ox, 
Here we must insert the expressions (84.15) and (84.16) and integrate over 
x’ and x”. 
We first split off the term that decreases as 1/r. It can be written as 


0 Yf U 3 | ew | 85.17 
S? A 4] pU,(dx)? + | ov, in Oe (dz) (85.17) 
On the other hand, if the obvious relation 
ow 
pace ee 85.18 
fe a 0 (85.18) 
is differentiated with respect to time, it gives 
t (aw aw 
peers aaa 3 85.1 
} : (= Be oe sa) ee : one) 
and this enables us to write 
a2w 
sy = 14] ou yaa) —{ ae | 2 
p= H-4] et (dey — | p —— (dz) (85.20) 


We now consider the terms in (85.16) which decrease as 1/r?. Among these 
there are some in which the integrand does not depend on x’ or x”. These give 


S, - wees ie Uae 
r 4r 
The remaining terms of order 1/r? are of dipole character. Denoting them by 
Si), we have 


r= ee BH ena 
eat [ (ovr) )'p w\ zi z ACA i ©;)(x;, — 2) (da’)3(dx")3 


r’ — r”| 3 


eat {c (pv;)'p” “Te no (dx’)3(dz")3 


aoe = £30 { (ovx)’p ree (da’)3(dar")? (85.22) 


(85.21) 


We write this expression in the ee 


a1) _ YP 
SY =f (dy + By) (85.23) 
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where A and By are, pespertavely, the antisymmetric and the symmetric parts 
of the coefficient. 


Ay = — Ay; By = By 85.24 
We then have : ce 


Ay = —arf fi (ev)'p” re = lr —-¥] (dz’)(dz")8 


+ — it] | (one Ye aT 


— 77] | ont iF Chet — BEN =D rycen (5.28) 


sr" | 


(dx')*(dar")3 


and 


By = = oe iff (ev)’e )’ 7 ee al (dx’ (da”)3 


1 J (ute ry aarmae'y 
1 ‘eof? a “ow 
ae All (evr)’p” eaep eeneey 


r a 
+- zal | (pvx)’e” re = ra (dx’)?(dx")? (85.26) 
After some manipulation we can te Ay as 
1 aw ald 
, = pee eye 3 
w= — 2) ple — algae — 5) 0 (ae — ao) ay 
(85.27) 
Here we have used the relation 
Q 
CUE OE (85.28) 
at “OLR 


which follows from the definitions of W and W, in (85.07) and (85.08) together 
with the equation of continuity. 
For the symmetric part By of the dipole coefficient we get 


7 1 eu aU 
By = — i o(xyv, + 24v4)U (dx)? + | prety (m2 — Fe) tae 


aw 
+58u { p—-(dz)* (85.29) 


Here we have used the relation (85.28) and the analogous relation for the 
potentials U and Ux. Jt remains to form the sum U; + (1/c?)S; and to combine 
similar terms. Denoting by U? the first term in (85.10) and using (85.20) we 
obtain 


Ul +— * §9 = «f {(c* + 4u)r% —£ £ (au, + has (85.30) 


But, by virtue of (85.11) and (85.12), this can be expressed as 
yh (85.31) 
r 


1 
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where P; ‘; the total momentum of the system, including corrections of the 
order 1/2’. We now consider those parts of the dipole terms which have anti- 
symmetn : coefficients. It follows from (85.27), (85.11) and (85.13) that they 


are equal +9 
1 ,.\ _ ¥% 
al (c2 + 4U)(2,T% — x41 %)(dz)? + a Ai _ oy My (85.32) 
where Mj; is the total angular momentum of the system, including relativistic 


corrections. ; 
The complete set of dipole terms can be written in the form 


1 1a 
Uw 4. 5 So = oe (M,, + Dy) (85.33) 


where Dj; is the symmetric part of the coefficient. The use of a dotted symbol 
for it anticipates the result that this quantity can be represented as a time 
derivative of a certain quantity Dj; which has a simple physical meaning. 
Comparing the last equation with (85.10) and (85.23) we can express Dj as 


Dym | (et + AY May + Peg de)? +7 By (85.34) 


If we insert the value of 7% given by (85.03) and the value of By; given by 
(85.29) we get : 


= 1 
Dy = i p(reay + vye)(dz)> + =| p (40? + IT — 2U)(vexy + vyx)(da) 
1 . 1 eu aU 
= s/ (pexvuty + pjxvers)(dx)® + xa! erry (n= = eae 
7 ow 
=i eo by | ea (da)? (85.35) 
Adding to this the expression 


mi —p——~- — 3 
mo { L4LjVK (ows e oi Pet) a) =0 (85.36) 


which vanishes by virtue of the internal equations of motion, /and using the 
relation 


ow, d 
p> (dz) = 73 | eW (dz)? (85.37) 


as well as equation (79.04) for the total time derivative of the elastic energy II 


we can write Dy in the form 
dD, 
Dy = a (85.38) 


where 
] ' 7 
Dy = eect We (407 + IT — 30)| (dey sae By { eW (day 
(85.39) 


The first term in this expression is the moment of inertia of the system of 


bodies, calculated with inclusion of the masses of the kinetic and potential 
energies. 
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Now we can write down the complete expression for U; + (1/c?)S;. According 
to (85.10), (85.21), (85.31) and (85.33) we have 


1 L. 2; dD. 
Up 6, et pe plat al, 
ON gt get op ga a 
oe Ty2M P; y?M Praicr, 1 o7w. 
Fy t 
Oar Oxy Or | pummsa(de)? + Ao*rs cat? 
_ (85.40 
and therefore 
4y Qyxy oe 2 
Gio “hp, ge ST A ee 
: or Es e373 Mi Oxy Ot c8r s 
o2 2y Ty7MP; 7M Proce 4 o2W, 
ee 3 mee Panay eens 
OX; OL% =| aca ad a cer? o cr4 ce Ot? 
(85.41) 


It remains to find the temporal component 9, which by equation (83.30) 
is expressible in terms of the generalized Newtonian potential 0. By (83.31) 
and (83.32) 


M 2. 1 @ 
Pa eye, ace op 
r + iO oa on. F ae 


1 Be 1 ew 
sashes ee dz) + — 5. 
6 0x4 OLE CL; Tr | payee ) a c? at? © a) 


Here M is the total mass 


1 
M= | oft +5 Get +0 — 10)| (dx)? (85.43) 
which is constant by virtue of (79.45) and the X, are the mass centre coordinates 
1 
ef MX; = f aof1 + = (fv? + TT — 30)| (dx) (85.44) 


which, by (79.59), are linear functions of the time. The quantities Dj; are defined 

by (85.39). The last term in that equation, which is proportional to 8; drops 

out of the expression (85.42). In the octopole moments we have discarded 

relativistic corrections. The last term in (85.42) is a retardation correction. 
Inserting the expression obtained for U into the equation 


1 40 70? 
00!S2 Fate ne 85.45 
3 c =e ce * ce ( 
we obtain g°° as 
1 4yM  4yay oz by a 2y | : 
Oo omen cs eS eed)? 
a P a Or th c3r3 MAG ot 2ay Oxy CFr Dye Oxy Ox; Ox, 3c8r pagryee( ae) 
2M2 2 
Ty?M? | W4y*MPXyxy | 4 OW (85.46) 
cor? c°y4 ox ot? 


It is easy to verify by direct insertion that this expression, together with the 
expression (85.41) for g® satisfies the relation 


ot 
— it os" —0 (85.47) 
ei 
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exactly. In the equation (84.40) for the spatial components, which may be 
written as 


2 2 
gt* = — coy, + op = | ovyrn(da)? 
oz 2 2M x2, 
ea = O(a LVKE + BLK’ — 14x404)(da)® so a (85.48) 
Tj 


terms of order 1/c (more precisely q4/c*) have been retained, but higher terms 
have been dropped. If the quantity g™ is taken to the same accuracy it is easy 
to verify that the relation 
ag" ae 0 (85.49) 
ot OLk 
is also satisfied. 


86. Solution of the Wave Equation in the Wave Zone 

The expressions derived above for the gravitational potentials are valid at 
‘‘ moderately large ” distances from a system of bodies. As was explained at 
the beginning of Section 84, this term is understood to mean distances that are 
large compared to the dimensions of the system, but small compared to the 
length of the waves radiated by the system. If, on the other hand, the distances 
are large compared to these wave lengths one is concerned with the so-called 
““wave-zone’’. In the wave-zone the term in the wave equation involving the 
second time derivative may no longer be treated as a correction and the method 
of solving the equation must be different. 

In the case of the Solar System ‘“‘ moderately large’ distances extend as 
as far as the nearest stars, i.e. into such a region of space in which the system 
may no longer be treated as isolated. In practice, therefore, the study of the 
Solar System does not take one beyond moderately large distances. However, 
in some theoretical questions, such as that of the emission of gravitational 
waves, or that of the uniqueness of the solution of the gravitational equations 
one must consider distances which belong to the wave-zone, and are arbitrarily 
great in a mathematical sense. 

Before going on to the solution of Einstein’s equations for arbitrarily large 
distances we shall clarify the notions of “ moderately large distances ” and of 
the “ wave-zone ” with the aid of the example of a simple wave equation with 
an inhomogeneous term 


— =5 = —4no (86.01) 


The quantity corresponding to ) in gravitational theory will be the difference 
between g" and its limiting value at infinity. — 

If the “ density ” c is taken to be known the solution of the wave equation 
which is of interest to us may be stated as the retarded potential 


p= | oF] (dx’)8 (86.02) 
where [o] = oft’, r’) (86.03) 
with ti =t — 2 |jr—r’ (86.04) 


c 
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The fact that we choose this solution and not any other corresponds to our 
conception that the system is isolated and that the only sources of waves shall 
be the bodies that make up the system. The exact form of initial conditions 
is of no importance ; it is sufficient to assume that the initial disturbance was 
concentrated in a finite region around the system and that we are dealing with 
such times and distances at which the initial disturbance has dispersed. 

We assume that the density o possesses both time derivatives of various 
orders and “‘ moments ” of various orders, for which we introduce the notation 


wolt) = f o(c,r’)(dx')? 
u(t) = J xio(z, r’)(da’)® (86.05) 


ant) = f 2yayo(e, r’\(de') 
where + is a quantity independent of r’. 

The technique of solving the wave equation used in the preceding sections 
amounted to an expansion of the argument ¢’ of o and of o itself in inverse 
powers of c. Since such an expansion leads to the appearance of increasing 
positive powers of |r — r’| in the integrands this method clearly gives a well 
converging series only for “ moderately large” distances r. (The region of 
“ moderately large” distances can, in fact, be more precisely defined as the 
region in which the series converges rapidly.) Each term of the series can in 
turn be expanded in inverse powers of r, and then the coefficients of the expan- 
sion will involve the “ moments ” (86.05) taken at t =U. 

But we can instead split off the quantity 

eee ee (86.06) 
c 


in the expression (86.04) for ¢’ which then appears in the form 
page pare) (86.07) 
c 


If then t’ and o are expanded in powers of 1/c, only in so far as this quantity 1s 
not involved in t, we obtain a series of a different form, and one that converges 


for arbitrarily large values of r. 
If jn this series we take all terms which have the slowest decrease with 


increasing 7, we obtain an expression for which is valid in the wave-zone, ie. 
for very large values of r. The retention of none but the most slowly decreasing 
terms corresponds to the replacement of the quantity (86.07) by 


{=t+— (86.08) 


He; 
nme = (86.09) 


a1 


where n= 


‘3 a unit vector in the direction of r. In this way we obtain for } an expression 

of the form : 
ere (86.10) 
¥ = = H(t, 9) 


where the function 


, 


w(t, n) = | F (« + — r) (dx')? (86.11) 
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depends on three arguments, namely the quantity + and the two angles that 
determine n. We note that if u is an arbitrary function of these three arguments, 
) is an approximate solution of the homogeneous wave equation. 

If the moments (86.05) exist we can expand u as follows: 


n Ll mine . 
w(t) = Wy + — fie + 5 dae +. (86.12) 
or 
4 a ] Vly 
u(t,") = wo + oy ee fe 5 cape Hlth +... (86.13) 


where the dots denote differentiation of the quantities (86.05) with respect to 
their argument t, or, equivalently, with respect to time. 

The region in which the solution of the wave equation for ) can accurately 
be given in the form (86.10) is known as the wave-zone. 

When the dimensions of the system are small compared to the length of 
the waves emitted by it, the expansion (86.12) converges rapidly, and its first 
non-vanishing term will be the main one. If this term is p, the function will 
be spherically symmetric. 


87. The Gravitational Potentials in the Wave Zone 


We now turn to Hinstein’s equations as stated in Section 68. As in (68.13) 
we introduce the quantities 


v ae 9 a v v Vv 
Nov = (—$) 1098 Th, — tyy" + 49°"D) (87.01) 
where L is the Lagrangian 
1 ag%® 
= ——_—_]]., ~ +lyy 
PVH Ge, MY ne 
and the other quantities involved are defined as follows : 
: 1 agus ages agx* 
T+. 28 — ( ap Bp — que 
2g ox, +9 ox, : ox, E98) 
26 = IauIertl™ (87.04) 
= é log v(—9) : v Th 
y, = a ~=PY, (87.05) 
Then Einstein’s equations in harmonic coordinates appear as 
aig 1 argu. 8 
— (Re — v =, is Jae ee v __ ae v 
( 2 )c 4g*R) act 9 ta a, +Nw = — = Te (87.06) 


We must investigate the asymptotic form assumed at large distances by the 
solutions of these equations. To do this we consider first the wave equation 


Vero ite =0 (87.07) 


and replace in it the coefficients g°° by their “static” values; for simplicity we 
shall assume that the origin of coordinates is at the mass centre of the system. 
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We can here use the expressions (85.41), (85.46) and (85.48) in which, how- 
ever, we discard all terms that decrease more rapidly than 1/r and of the static 
terms we retain (for the time being) only those of order 1/r. We introduce the 
gravitational radius of a mass by the relation 


% = (87.08) 


and go over to the spherical polar coordinates which are connected with the 
harmonic coordinates by the usual relations (57.03). Then we can write the 
wave equation (87.07) in the form 

4a\ a OO OOD) op a ) 

a Ce = 0 (87.09) 


1 
OY = —(1 
¥ al = r Or 
Here A*y is the ordinary Laplacian on a sphere (see equation (57.06)). We are 
interested in solutions of the nature of an outgoing wave. For such solutions 
the term A*/r? will decrease at infinity more rapidly than the remaining terms 
in the equation and we can discard it. Then equation (87:09) becomes 


5 (1+ 2) St (F428) =0 (87.10) 


Here only r and ¢ are independent variables, the angles 9 and enter merely 
as parameters. 
We make the substitution 


n=f (87.11) 
and we introduce in place of r the variable 
r¥ = r+ 2a(log r — log 79) (87.12) 


where ry is some constant. Then, apart from small quantities, we get 


eee eer, (87.13) 


A solution of the nature of an outgoing wave will be 


f=f len) (87.14) 
where, as before, n is the unit vector (86.09) and + now has the value 
=t— = a (87.15) 
c 


or 


t=t— r 4. Qe log (2)} (87.16) 


0 


Thus the solution of (87.09) which is of the form of an outgoing wave, has the 
asymptotic form 


Seer (87.17) 


Here + is assumed to be finite while r may be arbitrarily great. 
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Under these conditions the asymptotic values of the derivatives of | with 
respect to the space and time coordinates can be evaluated as if » depended 
on them only through the quantity +. Putting 


= ail (87.18) 
a On, 
we have in the approximation in question 
ey b (87.19) 
Cx a 


a 
Neglecting terms of order «/r in comparison with unity we can take the k. 
to be given by 


sie ree (87.20) 


wo5; Ha+™ (87.21) 
Cc 


whence 
kk* = 0 (87.22) 


The quantities k, are proportional to the components of a four-dimensional 
null vector (in the Galilean space corresponding to the limiting values of the 
a”. 

We now add to the static values of the g"’ in the coefficients of the wave 
equation (87.07) their wave part b"’, putting 


jive eee a poe 
Cc cr 


got = bm (87.23) 
git = — Bi, + bt 


We shall assume of the 6" that in the wave zone they either have the form 
(87.17) or, at any rate, that they satisfy the condition (87.19). Therefore we 
can calculate the derivatives of the b“’ according to the rule 


= kb (87.24) 


Since the derivatives of the static terms in g“” will decrease as 1/r? and may be 
discarded, we also have 


dg’ 


= k bv 
oe k,b (87.25) 

The harmonic conditions for the g"” appear as 
kb = 0 (87.26) 


and they may be integrated with respect to t. Since we have already split 
off the static part of the g"” the constants of integration may be set equal to 
zero and we get 


k,bey = 0 (87.27) 
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Hence the mixed and temporal components among the b"” can be expressed in 
terms of the spatial components as follows : 

NUN 
= ote (87.28) 
The more precise form of 4/(—g)/c times the d’Alembertian which may be 
obtained by inserting the values (87.23) for the g"’ into (87.07) now appears as 


1 1 ay 

- V(— g) Ob = O° + ~ 6 

_V(— 9) OY b+ ears (87.29) 
But if | is of the nature of a diverging wave which depends on the space and 
time coordmates mainly through + and which satisfies (87.19), we have 


Nk 
ho a pS bt ; §00 
Cc 


a OP ay 
b Bn, Be, = k,k,b*) = 0 (87.30) 
as a result of (87.27). Therefore the additional term in the d’Alembertian will 
vanish separately and we can discuss the outgoing wave solution as if the 
coefficients of the d’Alembertian in the wave equation (87.07) and in Einstein’s 
equations (87.06) involved only their static parts. By this we have, so to speak, 
freed ourselves from the non-linear parts of those terms in Einstein’s equations 
which contain second derivatives. 

We now pass on to examine the non-linear terms containing first derivatives. 
We cannot discard them straightaway because in the wave zone they decrease 
no more rapidly than 1/r? so that they may strongly influence the asymptotic 
form of the solution. But when evaluating them we can introduce the simplifica- 
tions that follow from equation (87.25) and from the fact that outside the 
differentiation signs the components of the metric tensor may be replaced by 
their limiting values. Raising and lowering indices by means of these limiting 
values we can write for instance 


kt = g*h,; bu = g bee ete. (87.31) 
just as if we were dealing with tensors. Using the rigorous relation 
uy 
: as (87.32) 


te Fgh ba, 
we can then write 


k, ke 
=> by; * — — by 87.33 
Ya % by 3, y 2¢ v ( ) 
The quantities (87.03) and (87.04) will then be given by 
yT#.08 = a (kabue + Kobe — heb) (87.34) 
c 
1 
and es (k,b% + kgbx — k'b,9) (87.35) 
It follows from (87.22) and (87.26) that 
kIT, =0; yy’ =0 (87.36) 
Therefore, in the wave zone the Lagrangian has the value zero 
L=0 (87.37) 


just as for electromagnetic waves. 
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If we form products of the three terms in (87.34) with the three terms of 
(87.35) we get nine sums, of which, however, only one differs from zero, namely 


1 
TT °°]T2, = ie kek'bb,, (87.38) 
Inserting the expressions obtained into (87.01) and replacing the factor (— g/c?) 
by unity, we get 


New = ke (1, = i668) (87.39) 


Here the following properties of N¥’ are important. In the first place it is 
proportional to the product kk’ and when it is inserted into Einstein’s equation 
(87.06) it can give rise only to such terms in g"¥ and g"’, and also in b*’ and 
b’, which are also proportional to this product. But if one augments 5" by 
terms proportiona] to k"k’, i.e. if one makes the substitution 


b28 -> b28 + akeke (87.40) 


then, by virtue of the harmonic conditions (87.26) and of (87.22), there is no 
change in N*’. As a consequence the problem of determining the g"’, including 
the effects of non-linear terms, reduces to the solution of linear equations. 
We put 
1 

7 = say (baad ~ sbe68) (87.41) 
As will be seen below, this quantity plays the role of the energy density of 
gravitational waves. Equation (87.39) then appears as 


Sry 
Nw = oe ooh (87.42) 
Inserting this equation into Einstein’s equations (87.06) we obtain 
1 8rry 
5,18" > ae + ogk"k’) (89.43) 


In forming the mass tensor in earlier sections we entirely neglected electro- 
magnetic energy. But we may now take into account the energy of the electro- 
magnetic radiation of a system of bodies. Denoting its density by Oem 80 that 


1 
Cem = oat + H?) (87.44) 


we get for the electromagnetic energy tensor in the wave zone the expression 
Te = Cemkt kh’ (87.45) 


(We are retaining the symbol 7” for this tensor because in the wave zone the 
entire mass tensor reduces to (87.45), the part corresponding to the material 
medium being zero.) 
Thus we can write Einstein’s equations for the wave zone in the form 
Bry 


1 
5,18™ = oe ohh (87.46) 
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where o can be taken as the sum of c,,, and o, if we include electromagnetic 

tadiation, and simply as og if we deal with the purely gravitational problem. 

To solve (87.46) we split off the term in 6%” which is proportional to k+k’ 

writing bY as 

bY = heY + Aktky (87.47) 

As has already been noted, the expression (87.41) for o, does not change if 
b+’ is replaced by h#’, so that we have 


ar 
Cg = soe has — kath’) (87.48) 
The relations (87.27) and (87.28) also remain valid ; we may write 
k hey = (87.49) 


The quantities h" and h appearing in (87.47) can be subjected to the con- 
ditions 
One = (87.50) 
and 


16 
Cs —o (87.51) 


in which, according to the remark made above, we may. replace the complete 
wave operator L] by the “static” wave operator 1° (equation (87.09)). 

Then the h*’ satisfy a linear wave equation and by (87.17) we can require 
that they have the asymptotic form 


Sal ON Hy 
her = = f(t, m) (87.52) 


The coefficient 2y/c? has been introduced for convenience in estimating the 
order of magnitude of the h¥’; the spatial components f** will be of the order 
of the kinetic energy of the system. 

Inserting these values for h*’ into (87.48) we obtain an expression for a, 
that is inversely proportional to r?. The 7-dependence of the electromagnetic 
part of the energy density will be of the same form. Therefore the total energy 
density o in (87.51) will also be of the form 

oo(7; n) 


o= (87.53) 


2 


The gravitational part oo, of the function o, can be expressed in terms of the 
quantities f” by a relation analogous to (87.48) namely 


Fog = Saal as = 4 fz fg) (87.54) 


Inserting the expression (87.53) for o into (87.51) we get for h the equation 


16ry 


Oh = = oo(T, n) (87.55) 
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The right-hand side of this equation can be considered to be known. Thus not 
only the h“” but also h satisfies a linear equation. The asymptotic form of the 
solution of this equation will be 


jm TERE f alsym) d+ se (87.58) 
r 


Let us put 
Arce | o((t, n)dt = A&(z, n) (87.57) 


Te 


If this expression is multiplied by dQQ/4, where dQ is an element of solid angle 
it gives the flow of energy in the time t —'t9 into such a solid angle situated 
in the direction n. The result of inserting (87.57) into the previous equation 
may be stated in the form 


h= ““hlog r A&(t, n) + e(t, n)] (87.58) 


where the quantity ¢(t, nm) may be taken as of the same order as Aé(z, n) or 
alternatively it may be omitted, the corresponding terms in the equations for 
g“” being included in f+’. 

Inserting the values (87.52) and (87.58) for h*’ and A first into equation 
(87.47) for b*’ and then into equation (87.23) for g*’, we obtain the following 
symptotic expressions for the g"’ in the wave zone. 


1 4M 
00 — _ 
8 c + cr 


2Y roo 1 2Y 4; 
oa + (log rAd +) 
got = 2Y po + at nz (logr A@ + €) (87.59) 
er ctr 
git = —c3;% + 2Y rae + ay ning(log rA& + ¢) 
er cr 


Here the quantities f% and f are expressible in terms of the f¢* by equations 
analogous to (87.28) namely 


n Nin . 
fea = fie; foo — = fit (87.60) 
Let us compare the expression for the g#’ in the wave zone with the corres- 
ponding expressions for moderately large distances as obtained in Section 85. 
In doing this we can first of all neglect terms involving log r and arising from 
the outflow of energy. As for the remaining terms, we can write them in such 


a form that they go over into corresponding terms in equations (85.41), (85.46) 
and (85.48). To do this we put 


Delt) = [ paxca(dzy? (87.61) 
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and then replace ¢ by + and relate ft to Dy(r) by 


ik — qa) (87.62) 
Then the equations 
1 4yM ya? Dix) 


g° ae mae. oe ee 
c cer C 0x 0x, Or 
2y 8? Dax(t) 
gt = — ao (87.63) 
2y oe? 
a — — 084, + mL 


cat or 

go over in the wave zone into (87.59), apart from terms arising from the out- 
flow of energy, while for moderately large distances they give the main terms 
of equations (85.41), (85.46) and (85.48). At moderately large distances the 


logarithmic term in the expression (87.15) for t may be omitted, i.e. r* may 
be replaced by r. 


88. Some General Remarks on the Conservation Laws 


Gravitational energy plays a very special role in the theory of gravitation, 
which is quite different from the role of all other forms of energy. It does not 
enter explicitly into the energy tensor but is accounted for indirectly through 
the gravitational potentials. The presence of a gravitational field and of the 
energy related to it reveals itself, as we know, in a change of the properties of 
space and time. Gravitational energy can be separated out in the form of 
additional] terms in the energy tensor only in an artificial manner by fixing 
the coordinate system and reformulating the problem in such a way that the 
gravitational field is taken to be superimposed on a space-time of fixed pro- 
perties, just as is done in‘ Newtonian theory. The additional terms in the 
energy tensor that correspond to gravitational energy do not possess the 
property of covariance (i.e. they do not form a tensor). This is quite under- 
standable because of the lack of uniqueness in the choice of the space of fixed 
properties into which the gravitational field is embedded. On the other hand, 
the difficulties that arise in the question of determining gravitational energy 
uniquely can be related to the fact that it cannot. be localized. This property 
of gravitational energy shows itself physically in the fact that a gravitational 
field cannot be screened. The only way to avoid the action of a gravitational 
field is to remove oneself further from the masses producing it. This can be 
done with an isolated system of masses. The gravitational field produced by 
such a system can be regarded as a local non-uniformity in infinite Euclidean 
space (or in Galilean space-time). With this form of description it is possible 
(neglecting radiation, of which more will be said below) to form the ten integrals 
of Kinstein’s equations which correspond to the ten classical integrals of the 
equations of mechanics. Four of these-- the integrals of energy and momentum 
—-form a four-dimensional vector in the Galilean space-time in which the system 
of masses with its gravitational ficld is embedded. The remaining six integrals 
form an anti-symmetric tensor in the same space-time; they are the integrals of 
angular momentum and of the motion of the mass centre of the system. _ 

It is important to note that the existence of the ten integrals of motion is 
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related to the uniformity and isotropy of the Galilean space-time in which 
the system is embedded. If one gives up the requirement of uniformity and of 
Euclidean character at infinity this brings about a violation of all or some of 
the conservation laws of which the ten integrals of motion are the expression. 
From a physical point of view this is perfectly natural, for the isotropy and the 
Euclidean character at infinity express the fact that the system is isolated, 
and one can expect the conservation laws to hold only if the system is isolated. 

One should note another reason why a system of moving masses can never 
be completely isolated in an active sense (i.e. in the sense of giving out energy 
rather than receiving it). This reason is provided by the radiation of electro- 
magnetic and of gravitational waves by the system and, possibly, also of other 
kinds of waves. However, for a system like the Solar system the loss of energy 
by radiation, even over geological periods, is very small in comparison with the 
available store of energy. This is so in spite of the fact that in absolute measure 
the loss appears an impressive one (for the Sun the radiative power corresponds 
to a transmutation of four million tons of matter into radiation per second). 
As for the radiation of gravitational waves, it is completely negligible : a rough 
estimate using equations based on the results of the preceding section shows 
that the power radiated by the Solar system in the form of gravitational 
waves is 103 or 10% times less than that radiated electromagnetically and is, 
all in all, about a single kilowatt. (Such an estimate will be made in Section 90.) 
Therefore it is permissible to neglect the action of gravitational waves in all 
but, possibly, purely theoretical considerations. This result shows in particular 
that the problem studied in Chapter VI of the motion of systems of gravitating 
masses may be considered as a problem in mechanics, disregarding radiation, 
not only in the approximation in which it was solved there but also in further 
approximations, which otherwise present no practical interest. 


89. Formulation of the Conservation Laws 
In a theory working with Galilean space-time the classical conservation 


laws can be stated in differential form, namely in the form of relations (31.10). 
The generalization of these relations is 


Vv, 7" = 0 (89.01) 


and, according to (41.25), this can also be written as 


1 7) 
Veg oe, V9 P+ TT = 0 (89.02) 


However the relations (89.01) do not by themselves lead to conservation laws. 
The mathematical reason for this is the presence of the second term, standing 
outside the sign of differentiation, owing to which, one cannot, in general, 
conclude that any quantities of the form of volume integrals are constant. 
(See Section 49.) The physical reason is the fact that the gravitational field 
itself possesses energy which does not enter J explicitly, but which must, 
nevertheless, be included in the general balance. 
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To obtain a relation in which gravitational energy is explicitly included we 
consider Einstein’s gravitational equations as stated at the beginning of 
Section 87. : 


We have 


=) 1 
(=2) 3 ae as emer vets ore (89.03) 


c2 


where N+” is determined by equations (87.01) to (87.05). On the other hand 
we have, in a harmonic coordinate system, , 


2 B2quy Cat aqS 
@B HY _ qt bv) — a 8 ae 
Bn, Be, (emg — g*¥g") = g Bo, i, Ou, On, (89.04) 


Multiplying this equation by 1/2c? and adding it to the previous one we get 


g 1 o? 
(- A (Re Pa 49""R) + 5e8 ray Or (g*® gt’ = g*" 9%) — rT’ (89.05) 
a lB 
where 
1 age ag* 
ES NO (89.06) 
2c? ax, Oa, 


Equation (89.05) would be an identity in an arbitrary coordinate system if 
IL’ was understood to mean 


1 dg*" ag = ag*® AghY = Agt# <2") 
Te — Nw 4 [ EE (ee 89.07 
+ sal OX, OL, OX, Ox, OX, Ag ( ) 
This relation goes over into the preceding one if the coordinate system is 
harmonic. To prove equation (89.05) one has to use the complete expression 
(D.87) for the Einstein tensor which is derived in Appendix D. 


Using Einstein’s equations 


RY —4g/R = — — Tuy (89.08) 
and putting 
2 
ye — ( = §) qe 4 2 Tus (89.09) 
c 8rry 


we can write equation (89.05) in the form 


o2 
0x, OL, 


(gt gt — tg?) = 16myU (89.10) 


Here we have on the left an expression of the form of the Krutkov tensor 
discussed in Section 31; the sum of the derivatives with respect to x, of the 
left-hand side of (89.10) is identically zero. We therefore have 


ave 


—0 (89.11) 


i 
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The set of quantities U"” is not a generally covariant tensor. It is a tensor 
only with respect to linear transformations ; in particular U*’ is a tensor in a 
harmonic coordinate system. Adopting the somewhat artificial point of view 
that was described at the beginning of the preceding section the second term 
in (89.09) multiplied by c? can be interpreted as the energy tensor of the gravita- 
tional field and the first term multiplied by c? as the energy tensor of the material 
media and all fields other than gravitational. If other than harmonic systems 
of coordinates are also admitted such an interpretation will not be completely 
unique if the region of the mass system is considered. But at large distances 
from the masses, where space-time is nearly pseudo-Euclidean and the co- 
ordinates are, by choice, Galilean, the physical meaning of the U¥’ becomes 
unique in any case. 

The conservation laws in integral form which follow from (89.11) are also 
obtained uniquely and do not depend on the arbitrariness connected with the 
deviation of the coordinate system from the harmonic within the region of 
masses. As we shall see, this is due to the fact that the volume integrals 
expressing the energy, the momentum and other quantities may be trans- 
formed into integrals over surfaces surrounding the masses. 

We now go on to derive the integral form of the conservation laws. To do 
this we multiply the left-hand side of (89.11) by the Euclidean volume element 


dx, dz, dx, = (dz) (89.12) 


and integrate over some sufficiently large volume, including the system of 
masses. For the time being we leave the dimensions of the region of integration 
undetermined. 

Using the Gauss-Ostrogradsky theorem we obtain equations of the form 


i Uda)? = — | nU% dS (89.13) 
and 
s | Ue(dz)3 = — | neU® dS (89.14) 


where 7; is the unit vector normal to the surface. If the boundary of the volume 
of integration is a sphere, we may put 


; dS =rde (89.15) 


where dw is the element of solid angle. 
As a consequence of the symmetry of the U4” in the indices u and v equations 


(89.11) also give rise to relations analogous to (31.06) and (31.07) which give, 
after integration 


d 
| eum — 2£,U")(dz) = — | nao — 2,Uh)dS (89.16) 
and 


d 
5 | (aU — U0%y(dxys = — | ny(aU! — 1Ui)ds (39.17) 
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We discuss the physical significance of the volume integrals on the left- 
hand sides of these equations. We putt 


Mae | Uda) (89.18) 

i J U (da) (89.19) 

Mi — @ J (ed — m0 (dey (89.20) 

and Mo = 2 | (aU — 1U%)(dex)3 (89.21) 


(We have written asterisks above the quantities (89.18) to (89.21) so that they 
should not be confused with the constants introduced when solving the equations 


* 
of mechanics.) The quantity M is the total mass of the system, including 
x 
the mass pertaining to the field enclosed within the volume chosen. The P# 
* 
represent the momentum and the M‘* the angular momentum of the system. 


* 
The MM? can be written in the form 
* Co 3 
MM? = MX* — PY (89.22) 
* 
where the X# are the mass centre coordinates of the system. It is therefore 


< 
clear that the M?° are the quantities involved in the law of mass centre motion. 

The value of the volume integrals (89.18) to (89.21) may change somewhat 
with the size of the region of integration. This is so because the field also pos- 
sesses ‘energy, momentum, etc. However it will be made clear in Section 90 
that with an appropriate choice of the region of integration the resultmg 
indeterminateness in the value of the integrals is negligible in comparison with 
their total value. 

We show now that not only the time derivatives of the quantities (89.18) to 
(89.21), but the quantities themselves may be represented as surface integrals. 
To demonstrate this we consider the structure of the expression (89.10) for 
U+’, Putting u = v = 0 we get 


16ryU © = 


2 
0X, OLy 
For « = 8 = 0 and also for a = 0, B = i (wherc + = 1, 2, 3) the expression on 
the right becomes zero. Therefore the indiccs « and £ in (89.23) actually take 
on only spatial values and we can write 


a2 


(g7? go = g7°g®°) (89.23) 


Gry ee (89.24) 
Dis OF 
whence 
¥ ie é tk (0,4 Q95 
M= | ng —— (gig — gq?) (89.25) 
Ory Ok 
Similarly we have 
Gry" = = ——— (gq — 9799") (89.26) 
COL, Vj 


+ We write 31%, Pi and X¢ with upper indices. 
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where j takes on only spatial values, but the values of « include zero. (We 
note.that the value « =7 actually drops out.) Insertion of (89.26) into the 
volume integral (89.19) gives 


* c2 


t — 
P 167y 


[ mg (arat — gree" 4s (89.27) 
Cx, 
From (89.26) it is easy to derive the relation 
16ry(2,0% — x,U%) = é ante ia (927 go" os g7°g?*) 
Oxj\ OL, 


— tr ao (9% g% as g% git) + gi* 9% — aioe} (89.28) 
OL, 
the right-hand side of which represents a sum of derivatives with respect to 
space coordinates. Inserting (89.28) into (89.20) and using the Gauss-Ostro- 
gradsky theorem we obtain an expression for the angular momentum in the 
form of a surface integral : 


c2 


* a 
tk — afp_ok __ 420 gik 
M i6 fmf or, (gq g*°q/*) 


— ax A (ater — gra") + gia — gitgtt|aS (69.29) 
L, 


a 


* 
Finally, the first term in the expression (89.22) for 1‘ may be written as 


2 
Mx ee [nr rye (gig — gigt) + gigto — argr|as (89.30) 
16ry Ox 

Thus all the quantities (89.18) to (89.21) have been represented as surface 
integrals. The value of these integrals depends only on the behaviour of the 
gravitational potentials at large distances. 

When writing down the conservation laws in differential form we have 
used the symmetry of the set} of quantities U"’ defined by (89.09) ; as we have 
indicated these quantities represent an analogue of a contravariant symmetric 
tensor. In the literature it is, however, more usual, since the earliest papers 
of Einstein, to employ a different set of quantities which are the analogue of a 
mixed (non-symmetrical) tensor and which are defined as follows. + 

At the end of Section 60 we derived an equation for the variation of the 
action integral, according to which 


8 [ LV(—g)-(dz) = { (Re? — 49°R) Bg,¢,/(—9)-(dz) (89.31) 


+ A different, but also symmetric set of quantities is used in the book a Landau and 
Lifshitz, The Classical Theory of Fields [23]. 


t The reader not specially interested in comparing ou. form of the conservation laws 
with Einstein’s may skip the rather tedious calculations from here to the end of Section 89. 
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Here L is the Lagrangian (60.23) and (dx) stands for the product of four 
differentials 
(dx) = da, dx, dz, dx, (89.32) 
Since 
8908 = = Jay Ievd9” (89.33) 
equation (89.31) may also be written as 
8 { LV(—9)- (de) = — f (Ry — B9,,R) 89V/(—9)-(d2) (89.34) 


Putting 
6] 
w  @ (89.35) 


we can define the “ partial derivative” with respect to g#” and to g#” by the 
equation 


[Ly (—9)] = ee tae + oe 


Insertion of (89.36) into (89.34) and integration by parts shows that the con- 
servative tensor 


Sgt (89.36) 


Gy = F,, — 29, (89.37) 
multiplied by +/(— g) may be written as 
og _ 8 AlW/(-9))  ALV/(—9)] 
v/( 9) Guy = on, age” = age (89.38) 


We now introduce a set of quantities w° defined by the equation 
«will (—9)1 
2/(— 9) v3 = 95 age oe ss [LvV/(— 9)] (89.39) 
Forming the sum of derivatives of (89.39) with respect to the z, and using 
(89.38) we get 


— . LV 
i Ala Dc (89.40) 
V(—9) ox, 2 ox, _ 
But for any symmetric tensor we have 
7h 
re Gi= — ie qu = —Te@y (89.41) 
'p 
Therefore the preceding equation may ee written as 
= 4 a = —-he (89.42) 
mes Le 


On the other hand, the divergence of the conservative tensor (which vanishes - 
identically) has the form : 
vg 2 AVE 9) 9°] _ pug — 0 (89.43) 
ve 9) ae 
Here the second term coincides with the right-hand side of (89.42). Eliminating 
it from (89.42) and (89.43) we get 


= tVi— 9-65 + V—9)-0g] = 0 (89.44) 
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We now express G° in terms of T? by the gravitational equations and put 


Then we can write the preceding equation as 
7) 
a VED Vel = (89.46) 


This equation represents the differential form of the conservation law as it is 
most frequently stated in the literature. The quantity 


Ue = ue (Te + 12) (89.47) 


corresponds to our U+’, but it is not symmetric in its indices. Therefore it is 
not possible to derive from the relation 
U 
fa] a 

eF=0 (89.48) 
0x, 


equations corresponding to the conservation laws for angular momentum and 
for mass centre motion. As for the conservation laws for energy and momen- 
tum, both formulations, based on (89.11) and on (89.48) respectively, give equiva- 
lent results. We shall demonstrate this, with the omission of some fairly com- 
plicated manipulation 

It follows from the definition (89.39) of the wS that 


dgx® 
2ue = —Ts, o — 88h + Vee = Ay —T)y, (89.49) 
Ly ar 
Hence, using the equations of Appendix D, we obtain 
0 g 7) 
9 —aq)-(G2 w?) = —— eT Of qo%q_th __ qotaeB 89.50 
V9 G3 + 1) = [HE (ara — orat)] (89.50 
and after multiplication by —c: 
7) cg 0 
16 Ue = = eT ot,aB _ q3%q78 
mils = =| Pe carat — a0] (89.51) 
This equation is analogous to (89.10). We obtain from it 
Dowd 3 ¥ ae d xj Q OT 20 JT 5? 
ja Mae oo, ee ae 8 (89.52) 


Since.the integration is over a aan ee we can take the limiting value 


of g,./ 4/(—g) outside the integral. Comparing the expression so obtained with 
(89.52) and (89.27) we may write 


9. *K 
[0 U%\( dx)? = (4. ] P= (89.53) 
. x V/(—9)} © 


where P® = M and pt has the value (89.27). The last equation can also be 
written in the form 


{ Usa = (oe Ses =). fom U=(dax)3 (89.54) 
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These relations confirm the fact that in spite of the difference in the differential 
forms of the conservation laws, the corresponding integral laws are equivalent. 
In addition, the presence of the quantities (9pz)oo in the relations (89.54) shows 
clearly that the total energy and total momentum of the system form a four- 
dimensional vector in the Galilean space-time in which the system is embedded. 


90. The Emission of Gravitational Waves and its Role in the Energy 
Balance 

In Section 89 we stated the conservation laws for energy and other quantities 
as equations expressing the balance of the quantity in question, i.e. expressing 
the fact that a change in the total amount of the quantity enclosed in some 
volume occurs only on account of the flux of the quantity across the surface 
bounding the volume. 

We now examine the question to what extent the flux through the surface ° 
may be neglected and the quantity in question may be considered constant. 
In other words, we enquire to what extent one can speak of conservation laws 
in the narrower sense. We shall confine these considerations to the conservation 
laws for energy and momentum. 

We use the definitions (89.18) and (89.19) for mass and momentum and 
take the surface of integration to be spherical. Then, according to (89.13) to 
(89.15), we can write : 


ar =—c | npr? day (90.01) 
‘and 

dPi 

> ees ce [ never dw (90.02) 

¢ 


the integration being over solid angle. We can take the surface of integration 
to be so distant that it lies entirely in the wave zone. Using the results of 
Section 87 it is simple to see that there the quantities L*’ defined by (89.06) 
or (89.07) reduce to the N#’. Taking for the N*” the values (87.42) and for the 
T” the values (87.45) corresponding to electromagnetic radiation, we may put 


Ue = okt ky (90.03) 
where o is the (electromagnetic and gravitational) energy density introduced 
in Section 87. Using equation (87.53) which gives the dependence of the energy 
density ¢ on the distance r, we can also write 


Ue = ee 1) ep (90.04) 
T 
Taking the value of k’ from (87.21) we have 
[ok — otet Qk Uik — ate) Nidk (90.05) 
er? a 
and therefore 
np = oat ae pl = oa, wv 4 (90.06) 


32 cr 2 
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Inserting these values into (90.01) and (90.02) we get 


_* 
ne | lao (90.07) 
dt c 
and @ 
4 
ie = -| NG (7, n)dw (90.08) 


But the density cp is an even function of the ni, both for the electromagnetic 
and the gravitational field. Therefore equation (90.08) gives 
* 


dPt 


ae 6 90.09 
77 ( ) 


As for equation (90.07), the integral in it will not be zero because ay is a positive 
quantity and it does not tend to zero for increasing r. Therefore an outward 
flux of mass will always take place. 

We consider first that part of this flow of mass which arises from the emission. 
of gravitational waves. To do this we must replace o, by the expression o,, 
given in (87.54), into which we must insert the values of f** from (87.60) and 
(87.62). For brevity we put 


a3 
A= ae Dit) (90.10) 
25 
Equation (87.54) then gives 
Coy = ae — 2A,A;y + Ande —4(A — Ay)?} (90.11) 


where we have introduced the short notation 
Ai => Aipny ; A= Amn = Aininy (90.12) 


It may be shown that Ses depends not on all six A, but only on five combina- 
tions of them, namely on the quantities 


Bu = An —Bn; a= 4fAy (90.13) 
which are subject to the relation 
Bi + Boo Ao Bas — 0 (90.14) 


The By, are the third derivatives of the quadrupole moments 
% : 
Dix(t) = | o (xiv — 38txr?)(dax)? (90.15) 


fort = +. Expressing the Aj.in terms of the By and inserting first into (90.12) 


and.then into (90.11) we can verify that the quantity a drops out of these 
equations and that, 


1 (BuBie — 2B,By + 4B) (90.16) 


o 
9 68x 


where the quantities B; and B should be expressed in terms of the Bix by means 
of formulae similar to (90.12). We cheek that the quantity Ogg 1s always posi- 
tive. Since it is invariant under three-dimensional rotations, it is sulficient to 
_make the check for any fixed direction of the vector n that enters the 
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expressions for B; and B. Putting * 


nm =1; n,=0; n,=0 (90.17) 
and using (90.14) we get from (90.16) 
mi 
So, = Fregit2( Boe — B,,)? + 2B} (90.18) 


which is indeed positive. 
Equation (90.07) involves the integral of the expression (90.16) taken over 
all solid angles. To evaluate it we can use the relations 


1 
| rime = 13% (90.19) 
and 
1 
Fa | teem deo = PBua8im + 3udem + Binds) ——_(90.20) 
which are most simply obtained by use of the identity 
1 1 
al (aini)?? da = rane (a2 + a2 + a2)? (90.21) 


Taking first p = 1 and then p = 2 and examining the coefficients of the powers 
and products of the a; we immediately get the required results. The integration 
in (90.21) is readily performed using a polar coordinate system with its axis 
in the direction of the vector a. 

Using the relations (90.19) and (90.20) we get 


=| (BixrBix — 2B,B; + 4B?) dw = 2Bi-Bix (90.22) 
TT 
Inserting (90.16) into (90.07) and using (90.22) we get 
* 
dM Y 
— = — — BB 90.23 
a Bor tk tk ( ) 


This expression gives us the rate at which mass flows out of the system as a 
result, of gravitational radiation. The corresponding equation for the rate of 
energy loss is obtained by multiplying (90.23) by c? and has the form 


* 
dw Y 
— = ——- ByB 90.24 

Ai Bes DiePak ( ) 

This loss of mass and of energy is completely negligible owing to the enormous 

value of the constant 


5c3 
— = 2 x 10% g/sec (90.25) 
Y 


If B characterizes the order of magnitude of the Biz, then taking the system 
formed. by the Sun and Jupiter we can put, in round figures 


= 1014 g/sec (90.26) 
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since the mass of Jupiter, its angular velocity of revolution around the Sun 
and the ratio of the square of its orbital speed to c? have the values 


uv ‘ 
M, = 2-10%g; w, = 2+ 10-8 sec; a =2-10° (90.27) 


Dividing the square of the number (90.26) by the value of the constant (90.25) 
we get a loss of mass of 5-10-!? g/sec. Translated into energy units this corre- 
sponds to the ridiculously small power of 450 W. For comparison we mention 
that the power of Solar electromagnetic radiation is about 4-10!? g/sec.; which 
is about 10*4 times greater. 

This estimate completely confirms the conclusion stated at the end of Section 
88 that in the problem of the gravitational interaction of heavy bodies gravita- 
tional waves play no part at all. 

Let us further examine the question with what accuracy a system of gravita- 
ting masses may be considered conservative if electromagnetic radiation is 
neglected. 

When we solved the problem of mechanics in Chapter VI we deduced the 
equations of motion to an accuracy that allowed us to find corrections to the 
energy of order Mg'/c? where g was some characteristic speed. Hquation 
(90.24) shows that apart from electromagnetic radiation we could have gone 
further, up to terms of order Mg*/c*. This is the accuracy to which the many 
body problem can be formulated as a problem of mechanics, with its ten 
classical integrals. We recall that in the problem of interacting charges (Sections 
26 to 28) the greatest attainable accuracy corresponds to the order Mq'/c?, 
because there radiation plays a much more important nart. 


91. The Connection between the Conservation Laws for the Field and 
the Integrals of Mechanics 


_ In Section 89 we derived equations for the time derivatives of the quantities 
x Ok * * 

M,.P!, M‘*, M® and found a representation of the quantities in the form 
of surface integrals. The expressions found for the total mass and for the 
momentum have the form 


ne Oi gtk {0 9k 
= Ba 00 _- 04k0 
6 | oe (g'*g q’g*)dS (91.01) 
and 
pi c? ra] ‘ 
= 4 =< aj q_0t _ _pa2d0nit 
16 ic 2a, (g%7g 9 gi*)dS (91.02) 


The angular momentum is given by 


c2 


- g 
Mik — aje0k _ qx0 ask 
was | ae, (9 gr’ — 9™g"") 


7) . ; 
— ae a (g% a" — gait) + gi*g% — gig ids (91.03) 
. a 
The quantity M® involved in the equation of mass centre motion is given by 


eo * Ox * 
M®=MXt— py (91.04) 
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* 
where P# has the value (91.02) and the first term is given by 


* * c2 


a 
MXit = . jk 00 _ pl0,ko 40,10 __ aft ,00 
sem | ma ia, (8 ggg) a aM eg! = 9g | ds (91.05) 


According to how distant the surface of integration S is chosen to be, the 
values of these integrals for the total mass, and the other quantities will be 
somewhat different because they will include a greater or lesser fraction of the 
mass, etc., pertaining to the “ pure”’ gravitational and electromagnetic radia- 
tion. As we have demonstrated in the preceding section, the energy of gravita- 
tional radiation is completely negligible. If we also neglect electromagnetic 
radiation we can place the surface of integration not in the wave zone, but 
instead at “‘ moderately large” distances from the system of masses; by 
doing this we, so to speak, cut off the mass and energy of the material media 
and the static fields, from the mass and energy of pure radiation. 

In Sections 84 and 85 we derived approximate expressions for the gravitational 
potentials at moderately large distances from the bodies ; these we shall now 
use. According to (85.46) the expression for g°° has the form 


1 4yM | 4yz, a 2y a 2y { 
00 j j me = at dz)? 
c 7 er a c3r3 ecu Ox, dx, Or * Ax, Ax, Ax, 3c°r par, (de) 
Ty?M? 14y? MP Xa, 4 aw (91.06) 
cr? cert ce of? 
The mixed components are given by (85.41) as 
4y 2yrj My 2 2y o? 
Oe Oh pt — D; Lt  opanrar (dx) 
= er - c3r3 Ox; Ot c8r a Ox; Oxy CFF priate (dz) 
Ty?MPt  y°MPtaay, | 4 OW, (91.07) 
ert i ee , 
Finally, (85.48) gives the spatial components as 
tk & a D 
Be a os eee 
g hk+5 a cap 
a? af 3 Pare 
— —~- = LVe 1 LjoKV — Lixxv,;)(dx)>? + —_— (91.08) 
Ox, Ot c*r as i cr* 


In the equations (91.06) to (91.08) we have written X‘ and P? instead of X; and 
P; in order to stress the vector character of these quantities. The value of the 


quantity Dj is given by (85.39) ; it Is 
1 7 
Dyi(t) = | co] + riche + II — 10)| (da)§ + Ie2 ay | eW(dz)§ 
(91.09) 


When inserting Dj, into equation (91.08) for g?* it is sufficient to retain the 
main terms which are just the usual moments of inertia. The argument of 
Dy is the time t, and not t — r*/c as in the wave zone. 
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The expressions for g™ and g” also contain terms involving W and W, which 
represent retardation corrections. These terms are approximately given by 


4 ow «(2 = ee ip ete Fa | exec dey 


ea o\r “7 | dt? 3c day Oxy Oxy dt? 
(91.10) 
and 
4 ow, YY % BPD; Y oyk LjLE d? 3 
oor Se ae ae aa ae ere ee 


The expressions for the g*’ just stated contain the quantities M, P!, M‘* 
and MX‘ — Pt the constancy of which is the expression of the mechanical 
conservation laws. We may therefore expect these quantities to coincide with 


* Ed * 
the values of the integrals M, Pi, Mt* and M?° which represent the total mass, 
momentum, etc., of the whole system, including the field. We now verify this. 
We have to insert the expressions (91.06) to (91.08) for the g“” into the surface 
integrals (91.01) to (91.05) and to evaluate them approximately. In this calcula- 
tion we shall retain terms of order g?/c? relative to the main terms when cal- 


* * 
culating M and Pt, but for simplicity we shall deal only with the main terms in 


* * 
the evaluation of Mt* and M?°. 

Leaving out the retardation term, we see that in all first derivatives of the 
g*’ the terms that decrease most slowly with distance are those of order 1/r? 
which are all odd in the coordinates z,, 22, £3, except for the derivative ag**/ at 
which decreases as 1/r and is even in the coordinates. In general the retardation 
corrections decrease more slowly but the correction in 29%/éz, also decreases 
as I/r? and is also odd in 2, 2». £3. 

Bearing in mind these remarks we form the expression 


7) dg” Age 
—— (gtk 00 _ gidgko) — aia _— ada 
2a, (8 9 g°°g*?) = g oe g on, (91.12) 


* 
that enters equation (91.01) for M. The right-hand side is obtained from 
the left by replacing the summation over k by summation over «, including the 
value 0, and by using the harmonic conditions. Since it is sufficient for the 
evaluation to know the odd terms of order 1/r? we may replace by their limiting 
values all the g#” in (91.12) that are not differentiated. Doing this we get 
00 10 
. ag” 1 ag 


8g ag? 
fa 9 a = Perea 
re ear Be a (91.13) 


a 


9 


This approximation is still valid if the retardation corrections are included, 
bearing in mind their order of magnitude in terms of gle. 
Using equations (91.06) and (91.07) for g® and g?°, multiplying (91.13) by 


mi = %/r and summing over 7, we get the following expression for the integrand 
in (91.01) 


Sea, ea eet, Y re 
Oz; 6 at oe? tb Gaga (84 — Bruny) Diy (91.14) 
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It is easy to see that the second term on the right is proportional to a second 
order spherical harmonic sc that it gives zero on integration over the sphere. 


. * 
There remains only the first term, and the integral for M becomes 
* 
M=M (91.15) 
. . . . i 
Thus, in this approximation, the total mass M of the system including the 


field, is equal to the mechanical mass M. 


. . 7 
Similarly we can calculate the quantity P‘. We have approximately 


Aq aalt 6q% 1 éal@ 
ee ace (91.16) 
Ox on, Ox, c at 


i} 


and further 


ox c at 
dy ay @Dy x d® 
apa — a, ga + Ga Ow — Sem) Fe | erecer(da)? — (91.17) 


Here terms of the order q?/c? as compared with the main terms have been 
retained and in them the expansion in inverse powers of r has been taken up 
to terms in 1/r?. In forming the expression (91.17) both terms of the retardation 
correction were included. 
On the right of (91.17) the second and third terms are proportional to spherical 

harmonics and give zero on integration. The first term gives 

* 

Pt= Pt (91.18) 
Thus the total momentum also is equal to the mechanical momentum. 


* 

We now pass on to the evaluation of M**. In the approximation stated we 
have 

ny( gi* g° — gi*go) = —e(ngg™ — nig?*) (91.19) 


We use equation (91.16) and the obvious identity 
NX, = NX (91.20) 


* 
and can then write the integral (91.03) for M**, as 


ik — Ok __ oe eS pp a 
cases ( | fen ( 9 1m, ct) at 


A ag%# 1 Za 
ee (nm ae ma C1 


To evaluate the integrals containing a@g/*/@ we would need to know g’™, 
including terms decreasing as 1/r* (we have in mind terms in g/* containing 
c? in the denominator). But in Section 84 we have determined g/* only up to 
terms decreasing as 1/r?. Although the evaluation of the missing terms does 
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not present any difficulties in principle, we can avoid it, if in (91.21) we confine 
ourselves to the main terms and observe that the integrals containing 0g**/ dt 
contribute only corrections of order q?/c?. The integrand in (91.21) then 
becomes 


dg” dg% 
ono — 2 s) — ons| cares 


4 8 9 
= “VY My i — (mPx — rePi) + = Bes — 3ngny)(My + Dy) 
2 
=. sy (343 — 3nany)(Myx + Dy) (91.22) 


Here the right-hand side is expressed in terms of spherical harmonics and 
involves harmonics of zero, first and second orderst. On integration only the 
first term remains and we get 


Mk = My (91.23) 


as was to be expected. 
It remains to evaluate (91.05). Using (91.13), we can write this integral as 


OK c? 8g°° 1 2) 1 
Vs 00 _ ah Seay es aU Pee 
MX ss | fons se a eT 5 Malad + 084s) ((dar) 
(91.24) 


Since @9/°/2 can be expressed in terms of the 0g/*/@zx% by use of the harmonic 
conditions and the latter quantities may be calculated from (91.08) up to terms 
decreasing as 1/r> we could evaluate all terms in the integral. But to simplify 
calculations which are of no great interest we restrict ourselves again to the 
main terms. Then we can write 


dg 4yM_X* 8yM 4y 
on( ate — 215) = Age b(t + EE) — abu — Song 
(91.25) 
Hence, arguing as before, we obtain 
Ok 
MX* = MX* (91.26) 
* 
and since we have already verified that P! = Pi, we have 
* 
M® = MX* — Pt (91.27) 


as it should be. 

The calculations just performed can be thought of as a verification of the 
correctness of the approximate solution of Einstein’s equations given by the 
equations (91.06) to (91.08). At the same time the results of these calculations 
illustrate clearly the relation between the conservation laws stated in general 
form and the integrals of mechanics. 


} If terms of relative order g*/c? were taken into account the integrand in (91.21) would 
involve harmonics of up to fourth order. 
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92. The Uniqueness Theorem for the Wave Equation 
In the next section we shall deal with the problem whether for an isolated 
system harmonic coordinates are uniquely determined. In that investigation 


we shall need a theorem which establishes the condition of uniqueness for the 
solution of the wave equation 


Lip = 0 (92.01) 


We shall now prove this theorem for the wave equation with constant coefficients 
for which 


_ 1 a% ap ap ary 
3m (3 + aa =) eae) 
We subject the function 
b= (a, ¥, 2, t) (92.03) 


to the following conditions :-— 
(a) The condition of boundedness : for all x, y, z and t the inequality 


lv] <M, (92.04) 


shall hold so that the function is uniformly bounded ; 

(b) The condition of decrease at infinity: if the distance from the origin, 
r= (2? + y? + 2?) increases without limit the solution shall satisfy the 
inequality 

M 
pl <— (92.05) 


r 
for all. and its first derivatives shall satisfy 


M 


[grad b] << —; < = (92.06) 


Thus the function ¢ and its first derivatives are to decrease at infinity in inverse 
proportion to 7, or more rapidly. The quantities My, M and M, are positive 
constants. 

(c) The condition of outward radiation: for 7 —> oo and all values of & = 
t + r/c in an arbitrary fixed interval the limiting condition 


- frp) 1 aAry)\ 
im { _ +, "| =o (92.07) 


ro 


shall be satisfied. It expresses the fact that there are outgoing waves only, waves 
coming from outside being absent. 

We can now formulate the following uniqueness theorem. 

The solution of the homogeneous wave equation Lh) =0 which satisfies the 
conditions (a), (b) and (c) is identically zero. 

In the proof of this theorem we start by using Kirchhoff’s formula for the 
solution of the wave equation.t Kirchhoft’s formula expresses the values of 


{ For Kirchhoff’s formula and its generalizations, S. L. Sobolev’s formule, see V. 1. 
Smirnov, A Course of Higher Mathematics, Vol. II, Sec. 202, 1957, and Vol. 1V, Sec. 148, 1958. 
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the function ) at a point in space r, at time ¢, in terms of the values of p and 
its derivatives on a surface S surrounding this point, the value of ) on each 
point r of the surface being taken not at time t, but at the earlier time 


t=t)— : |r —ro| (92.08) 


Kirchhoff’s formula reads as follows : 
veo 2 [(elS] oni 2 Ells cm 


Here R denotes the distance 
R= |r —fo| (92.10) 


and dp/dv is the derivative in the direction of the outward normal to S: 


a a 
—_—= 7 cos (nx) + “ cos (ny) + = cos (”z) (92.11) 


The bracketed expression [] signifies that the argument ¢ must be replaced 
by its value (92.08), this replacement being made in the derivatives ap] dv 
and &)/ét after the differentiation. 

We take for S the surface of a sphere of radius R centred at ry. Then we 
can put 


dS = R? de (92.12) 


where dw is the element of solid angle. Further 


cos (nx) = ai al cos (ny) = Y — Ho. cos (nz) = 2 (92.13) 
R R 
and since age 1 92.14 
wT were) 
Iso write nec 

we can also wri aap (92.15) 

Then Kirchhofi’s formula becomes 

aE sale 1 [Rp] 

V(Lo; to y= 2 | ( 5 eter am da) (92.16) 


The expression in curly brackets under the integral sign is given by 


ol R 1loOR 


a 
iodo + (2 — 29) eee (92.17) 


where ¢ has the meaning (92.08). 
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According to (92.16) the value of the function at ry, t, is the mean over 
solid angles of (92.17). In order that | should be zero at Io, to it is evidently 
sufficient if for increasing R the expression (92.17) tends to zero: 


. {FRY} 1 [Ry] 
l a a = — 
aoe oR T © At Sr-to—cRIJo) 0 (92.18) 


For any point Yr, Temaining at a finite distance from the origin this requirement 
will certainly be satisfied if conditions (a), (b) and (c) hold, for (92.17) can be 
written in the form 


ARP] | 1 ERY] 


ep tor aetates (92.19) 
where em) be ty 2 fh (92.20) 
a= — («3 + Wes + tos) (92.21) 
and eee — ees (92.22) 
Since |R—r| <1, (92.23) 


the equations (92.06) of condition (b) give 


r r 
Jer] <M; |eo| <M, (92.24) 
If 7, is fixed and r > co these expressions tend to zero. Therefore we have 
separately o1 > 9; 2 > 0 (for r > 00) (92.25) 
and for (92.18) to hold it is sufficient that 
o>0 (for 7 > 00) (92.26) 
4) 1a 
or lim | ee a =i (92.27) 
rool or c at 


This condition, as well as (92.18), is to be satisfied for values of 7 and t such 
that when r — oo, the quantity t; = t+ R/c remains fixed. Now let us intro- 
duce the quantity #; = ¢ + r/c into the formulation of condition (c). We have 
Ifo — | < 7o/¢, and since 7, is also fixed, the quantities t) and 4% will both 
belong to some arbitrary fixed intervals if one of them belongs to such an 
interval. One can therefore replace ty by ¢4 in condition (92.27) which then 
becomes the condition of outward radiation (92.07). This latter is a sufficient 
condition for the limiting equation (92.18) to hold, whence ) = 0 fellows 
directly (see [43]). 

We have proved that the function ) is zero at the point rj, but since this 
point may be chosen arbitrarily this proves that ) vanishes everywhere. There- 
fore the uniqueness theorem formulated above is proved. 
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We now consider the solution of the inhomogeneous wave equation 


—_—-— = — 4no (92.28) 


expressed in the familiar form of a retarded potential 


Y(To. to) = { fal (92.29) 
1 
where [eo] = e(r, 4) ; t= lo =e |r —Fo| (92.30) 


The integration in (92.29) is over the coordinates (x, y, z) and extends over 
all that volume where the “ density ” is non-zero, i.e. over the region of the 
masses. If this region lies entirely within the sphere r < a and if the time 
t, satisfies the inequality ct) >r-+a@ the calculation of the integral (92.29) 
requires the values of p only for positive values of the argument t. 

At large distances from the masses the function ), which satisfies the homo- 
geneous equation outside the masses, has the asymptotic form 


U(r, t) = ru? = -, n) (92.31) 


where n is the unit vector in the direction of the radius vector r. The function 
y. is expressible in terms of p as follows :— 


aC © -, n| a { o(r’ = - ree Nav" (92.32) 


These equations show that the retarded potential satisfies the conditions 
(a), (b) and (c) formulated above. By virtue of the theorem just proved it is 
therefore the only solution of the inhomogeneous wave equation (92.28) that 
satisfies these conditions. Thus, the theorem gives a mathematical justification 
for the use of the retarded potential ; usually it is derived from physical con- 
siderations. 

The uniqueness theorem we have formulated has only been proved for the 
wave equation with constant coefficients. One should expect it to remain true 
also for such equations with variable coefficients qg“” which have the asymptotic 
behaviour defined in Section 87. The proof of the theorem for the general 
case presents considerable difficulties and is an unsolved mathematical problem. 
It is, however, much simpler to prove the theorem for the case that one can 
restrict oneself to the “stationary” approximation for the g¥’, and may 
write the wave equation in the form 


say AY= (92.33) 


where A is the Euclidean Laplace operator and the “ refractive index ” has 
the value 


n=1+— (92.34) 


Here U is the Newtonian potential, which for this problem may be considered 
to be independent of time. One could construct the proof either with the aid of 
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8. L. Sobolev’s formula} which is a generalization of Kirchhoff’s for equations 
of the form (92.33) or one could expand w, considered as a function of t,in a 
Fourier integral and use the well-known Sommerfeld radiation conditions, which 
in this case are consequences of our condition (92.27). 

In conclusion we remark that the statement of the problem adopted in this 
section differs from the more usual statement in terms of Cauchy’s problem in 
that we do not explicitly introduce initial conditions, but instead study such 
solutions of the wave equation which do not depend on the initial conditions 
within the region of space and the interval of time in question. This formulation 
is dictated by the physical nature of the problem. In it the solution is made 
unique and independent of the initial conditions by limiting the space-time 
region : any initial disturbance that may have occurred in this region is sup- 
posed to have dispersed long ago and further disturbances cannot enter the 
region because of the condition of outward radiation ; in the case of the inhomo- 
geneous equation they may be generated within the region, but then they are 
determined not by initial conditions but by the “density ” o. 


93. On the Uniqueness of the Harmonic Coordinate System 


When solving: Kinstein’s equations for an isolated system of masses we used 
harmonic coordinates and in this way obtained a perfectly unambiguous 
solution. We found unique results not only for finite and “ moderately large ” 
distances from the masses, when the wave-like, i.e. hyperbolic, character of 
Einstein’s equations was not essential and was accounted for by the introduc- 
tion of retardation corrections, but also for the ‘‘ wave zone”. This wowd 
lead us to expect that the harmonic conditions together with the requirement 
of Euclidean behaviour at infinity and with the condition ensuring uniqueness 
of a wave type solution should determine the coordinate system uniquely 
apart from a Lorentz transformation. 

We now consider this question in detail. If Lh) is the invariant d’Alembertian 


1 @ a 
Oy = —— — (o *) (93.01) 
Y= 7 g)aa, \8 ae, 
the harmonic conditions 
ogt’ 
=0 (93.02) 
Ox 


iL 
may be written in the form 


{Lv 
eee EG (93.03) 
V/(—9) e, 

so that each of the functions 


p=%; Y= PH; YH (93.04) 
is a solution of d’Alembert’s equation 
Lip = 0 (93.05) 


Tt Sobolev’s formula differs from Kirchhofi’s formula (92.09) essentially in that it involves 
not only surface integral but also a volume integral. Reasoning as described above, we 
can take the integral over the infinitely distant surface to vanish by virtue of the con- 
ditions on . This then gives not directly } = 0, but instead a homogeneous integral 
equation which has zero as its only solution. The quantity (9 = ¢ Ae He in the statement 
of condition (c) of the theorem is to be replaced by (* = t + r*/c with r* from (87.12). 
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Evidently this equation is also satisfied by any constant and any linear function 


of the harmonic coordinates. —— . 
We first consider the case that the metric of space-time is Galilean. In 


Galilean coordinates 
tp; C= 2; Lg=Y; Ly = 2 (93.06) 
the components of the metric tensor are} 
gy” =e, (93.07) 


and the d’Alembertian has the form 
rs) 
Oly =e 2M, (93.08) 


Evidently each of the Galilean coordinates is harmonic. 
We seek the most general form for harmonic coordinates. Let us assume 
transformation equations to new coordinates of the form 


Z., = f*(Lo, Fy, La, Ls) (93.09) 
The new coordinates, like the old, must satisfy d’Alembert’s equation, they 
must be Galilean at large distances and must lead to values of the g*” with the 


correct asymptotic behaviour. Remembering these requirements we shall 
write the functions f* in the form 


ft = a, + eg, 5% + (Xo, Xy, La, Ls) (93.10) 
Here the a, and the a,, are the coefficients of a Lorentz transformation and so 
satisfy the relations 

€,@,,,2,, = 6,9 €,2,, An, = €,9 (93.11) 


a a va poopy? aan av wy 


Since the functions f* must satisfy the equation 


Dift7= 0 (93.12) 
and the linear terms satisfy this equation separately, we must have 
Cyt = 0 (93.13) 


This equation must hold throughout space and at all ¢ (i.e. all x). Since the 
equations (93.13) involve second derivatives it is natural to require that the 
functions 7* themselves and their first derivatives should be bounded throughout 
space and at all ¢. This requirement is all the more necessary because the first 
derivatives of the new coordinates with respect to the old enter the transfor- 
mation equations for any tensor. As for the conditions at infinity, since there 
the new coordinates are to be Galilean, the functions 7% which represent additions 
to the Lorentz transformation, must tend to zero together with all their first 
derivatives. In addition, the requirements resulting from the asymptotic 
behaviour of the g"” must also be satisfied at infinity. We consider these require- 
ments in more detail. Without restricting generality we can assume that the 
Lorentz transformation occurring in (93.10) is the identity, so that that equation 
can be written as 


Si = t, + 1° (Zo, Ly, Lo, Ls) (93.14) 


t When using the summation convention for indices occurring twice the index of 
en (€o = 1; & = eg = es = —]1) is not to be counted. 
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Calculating the components of the metric tensor in the new coordinate system 
by the general rule : 


Ox! Ox; 
’aB — guy “a “"B 
g g aa, an, (93.15) 
P On? ant Ont dyP 
we get ge 8 +e 0 +e ae y 
raat) = = 93.16 
a * 0x, e aa, * Qn, Oa, ( ) 


By the condition at infinity the quantities @y*/@z, tend to zero. Therefore 
the asymptotic behaviour of the g’* is characterized by the terms linear in these 
quantities. But this asymptotic behaviour must be such that the difference 
between g*° and its limiting value at infinity is an outgoing wave. (Corres- 
ponding relations for the g*° were introduced in Section 87, and similar relations 
for the g*° follow from them.) Thus we must have at infinity 


gg — (g**),, = outgoing wave (93.17) 


This requirement must be satisfied in any harmonic coordinate system, so that 
the g’*° of equation (93.16) must also satisfy it. It follows therefore that 


On? On 
a Oa, + “Za, = outgoing wave (93.18) 
If we put 1 = en, (93.19) 
and divide through by e,e, we can write this condition in the form 
Ong On, a . 
ae, + Bi = outgoing wave (93.20) 
Using the identity 
a On, on, a v é fo Y v 
alse z)+-(2 + 58) — 2 (Fi + Se) = 9 On 
Gx,\0n, om, Ox,\ 0x, Ou, Cx, \ Ox, On, OX, Ly 
(93.21) 


and passing from the », to the y” we conclude that 
Ory" 

0x, Og 
Hence one can deduce that 7” is the sum of a linear function and of a function 
which at infinity represents an outgoing wave. The same conclusion evidently 
applies not only to the functions 7* entering (93.14) but also to the * entering 
(93.10). Since in (93.10) the linear function has already been split off and since 
we have the condition at infinity »* + 0, we arrive at the conclusion that the 
function 7" itself represents an outgoing wave : 


7 = outgoing wave (93.23) 


= outgomg wave (93.22) 


The conditions satisfied by the additional function 7 in (93.10) can now be 
made more precise. This additional function satisfies the wave equation (93.13) 
and reniains bounded everywhere together with its first derivatives. At infinity 
both 7% and its first derivatives decrease as l/r. Finally, from (93.23) we can 
infer that at infinity y* satisfies a condition of outward radiation of the form 
(92.27). All these conditions must be satisfied for all values of ¢. By virtue of 
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the uniqueness theorem proved in Section 92 a quantity 7* satisfying these 
conditions vanishes identically. Therefore the function f* that enters (93.09) 
and (93.10) reduces to a linear function and the most general form of trans- 
formation from one system of harmonic coordinates to another is 

,, = A, + CA, p%p (93.24) 
i.e. it amounts to a Lorentz transformation. 

We have proved this result for the case of Galilean space-time with the 
metric (93.07), because the uniqueness theorem formulated in Section 92 was 
proved for that case. However, it appears certain that this theorem can also 
be proved for the genera! case of an Einstein space-time whose metric tensor 
has the asymptotic behaviour established in Section 87. If we accept this we 
reach the conclusion that in the general case also the harmonic coordinate 
system is uniquely defined apart from a Lorentz transformation. Indeed, 
the reasoning of this section remains valid in the general case. In a harmonic 
coordinate system the d’Alembertian has the form 

sath SOY 

Oy =g9 on, Oe, (93.25) 
and does not involve any first derivatives. Consequently any linear function 
of the coordinates satisfies the wave equation. Therefore, if we write down a 
transformation of the coordinates in the form (93.09), where f* has the value 
(93.10) and if we require that the new coordinates should also be harmonic 
(equation (93.12)) we shall again obtain for the additional function the wave 
equation (93.13), but with C1 now denoting the operator in (93.25). The 
additions y* and their first derivatives must evidently be bounded throughout 
space. As for the remaining conditicns imposed on the 7%, they all refer to 
the infinitely distant regions of space, where the metric differs little from 
the Euclidean. Therefore these conditions may be safely taken over from the 
previous case. In particular, the left-hand side of (93.18) will represent the 
difference between the asymptotic expressions for the g** in initial and trans- 
formed coordinates and must, therefore, correspond to an outgoing wave. Hence 
we conclude as before that the relations (93.22) and (92.23) are valid and then 
the uniqueness theorem leads to the conclusion that the additions 4% vanish. 

Thus in the general case also, the harmonic coordinate system is uniquely 
determined apart from a Lorentz transformation. 

We should note that we have not explicitly introduced any initial conditions 
for the functions f* that define the coordinate transfermation. This statement 
of the problem corresponds to the statement of the problem of determining the 
gravitational potentials g“”. In that case there is sense in introducing initial 
conditions only for the distribution of the gravitating masses, but not for the 
gravitational waves and not for the gravitational potentials themselves. The 
problem of determining the gravitational potentials is not Cauchy’s problem, 
i.e, not a problem with initial conditions, but rather the problem of finding 
a steady state which sets in when all gravitational waves have dispersed, other 
than those produced by the motion of the masses under consideration. (See 
above, end of Section 92.) 

Our discussion shows that the conditions ensuring the uniqueness of the 
coordinate system arise directly from the formulation of the physical problem. 
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Since this is not Cauchy’s problem as regards the determination of the gravita- 
tional potentials it will not be Cauchy’s problem in relation to the determination 
of the coordinate system. The determinateness in the formulation of the 
problem of the gravitational potentials is achieved by excluding “ transient ” 
gravitational waves by the condition of outward radiation. But by the same 
means we exclude the corresponding wave terms in the equations for the co- 
ordinate transformations, and thus in turn we achieve the uniqueness of the 
coordinate system. 

The question of the uniqueness of the harmonic coordinate system in the non- 
stationary case may also be approached from another direction, namely from 
the approximate method for solving the gravitational equations used in 
Chapter VI. If one follows the calculations performed there one finds that at 
all stages the solution, and therefore the coordinate system also, are obtained 
uniquely. Thus the uniqueness of the harmonic coordinate system can also be 
taken as proved in all cases in which the method of Chapter VI is applicable. 

Let us formulate our conclusions once again. In the case of an isolated. system 
of masses there exists a coordinate system which 1s determined uniquely apart from 
a Lorentz transformation if suitable supplementary conditions are imposed. This 
is the harmonic coordinate system, characterized by a first order linear differential 
equation and by boundary conditions for the g*”. 

The question may arise: can any coordinate systems other than the har- 
monic, exist, defined apart from a Lorentz transformation! This seems to us: 
to be unlikely. The harmonic coordinates are characterized by the fact that 
each of them, and also any linear combination, satisfies a linear, generally 
covariant equation. One could hardly find other coordinates with this property; 
other coordinate conditions, even when very similar to the harmonic condition 
(93.02), for instance ég*’/éz, = 0 do not reduce to a linear equation. 

“The fact that for determinateness Einstein’s ten gravitational equations 
must be supplemented by four additional equations is in itself evident. The 
difficulty consists in choosing these supplementary equations and in formulating 
boundary conditions. 

In contrast with our statement of the problem, the formulation given to 1t by 
Einstein leaves it mathematically indeterminate and there can be no question 
of uniqueness of the solution. According to Einstein there are no privileged 
systems of coordinates and the system of coordinates remains indeterminate 
to the end. Einstein and those who support this point of view raise this indeter- 
minacy to a virtue and see in it a deep meaning, namely the expression of some 
“ general principle of relativity ”, on the strength of which Einstein’s theory 18 
called by its author the “General Theory of Relativity ”’. . 

We cannot agree at all with such a point of view. The indeterminateness 
existing in Einstein’s formulation is by no means a matter of principle and has 
no deep significance whatsoever. It arises in any physical theory that can be 
formulated in a generally covariant manner. The so-called special theory 
of relativity has actually been formulated in a generally covariant manner 
(Chapter IV of this book) and thus possesses, In this formulation, the property 
of indeterminateness to just the same degrce as the so-called general theory 
of relativity (the theory of gravitation). What really matters is not the Dos: 
sibility of an indeterminate formulation (which is trivial) but just the opposite, 


374 The Theory of Space Time and Gravitation 


namely, a formulation that is as determinate as is allowed by the nature of the 
problem. From this point of view, the existence of a preferred set of coordinates, 
determined apart from a Lorentz transformation, is by no means trivial, but 
reflects intrinsic properties of space-time. This is most clearly seen In the case 
of the “special” theory of relativity, where the existence of Galilean coordinates 
reflects the uniformity of space-time. 

If this is made clear, the obvious necessity of introducing additional equations 
in order to define privileged coordinate systems (and thus to achieve a more 
determinate formulation of the problem) cannot give rise to any serious objec- 
tions. This necessity exists in the theory of Hinsteinian as well as in that of 
Galilean space-time. The fact that in the latter case no such equations are 
usually written down does not alter anything, because such equations can be 
written down or alternatively may be replaced by the verbal statement that 
Galilean coordinates should be used. 

{t is obvious that the additional equations mentioned are not generally 
covariant (otherwise they would not restrict the coordinate system.) Thus the 
objection may be raised that the additional equations destroy the beauty of 
the theory which according to the current opinion, lies in its general covariance. 
But this argument provides no real objection, in the first place because these 
equations must be compatible with the generally covariant Einsteinian equations, 
the simple form of which constitutes the real beauty of his theory. It should 
be added that the simplicity and beauty of the theory is due not so much to 
its covariant form as to its idea of unifying metric and gravitation. 

The uniqueness of the solution achieved by the use of a well-defined privi- 
leged coordinate system seems to us an undoubted advantage rather than the 
contrary. This uniqueness shows the physical meaning of the privileged system, 
while the actual physical significance of any other coordinate system only 
becomes clear given the equations relating it to the privileged system. 

As for the so-called “General Principle of Relativity ” no such physical 
principle can exist, as was explained in detail in Section 49*. In contrast the 
Galilean relativity principle for uniform straight line motion, and its generali- 
zation expressed by the Lorentz transformations, are physical principles. 

With the assumption of a rigid metric (i.e. one that does not depend on the 
physical processes ovcurring in space and time) the Galileo—Lorentz relativity 
principle leads to uniform space-time and, conversely, is an expression of this 
uniformity. In the case of non-uniform space-time one can consider the exis- 
tence of harmonic coordinates to be a further generalization (or wider applica- 
tion) of the Galilean principle of relativity; this reasoning corresponds to the 
generalized definition of the principle of relativity put forward in Section 49*, 
as a statement about the equivalence of frames of reference within a certain 
class. As an illustration let us consider the expressions (91.06)-(91.08) for the 
g*’. If we assume that the constants involved in these expressions are fixed and 
perform a Lorentz transformation we shall change the form of the expression 
for the g*, because the space-time is not uniform. But if we also transform 
the constants (M, P*) and (M%, Mit) by the same Lorentz transformation we 
restore the form of the g“’. This just means that if a process (the motion of 
masses) is possible in which the constants of motion have some given values in 
one frame of reference, then a process is also possible in which these constants 
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have those values in another frame of reference; in other words it means that 
the principle of relativity is satisfied in the form that was given to it in 
Section 49*, 

Here we have. considered the approximate expressions for the g"” and only 
the constants of motion, not any other quantities characterizing a system of 
bodies. But our conclusions also remain in force for the exact solutions, because 
the gravitational equations, the harmonic conditions and also the boundary 
conditions are stiictly covariant with respect to Lorentz transformations. 
Thus the principle of relativity expressed by the Lorentz transformations can hold 
im non-uniform space also, but a general principle of relativity cannot be valid. 

We have stressed repeatedly the fundamental significance of the existence 
of a preferred coordinate system, defined apart from a Lorentz transformation. 
It reveals itself, among other facts, in the following. Only if the existence of 
such a coordinate system is recognized can one speak of the correctness of the 
heliocentric Copernican system in the same sense as this is possible in New- 
tonian mechanics. If the existence of the preferred coordinates is denied, one is 
led to the inadmissible point of view that the heliocentric Copernican system 
and the geocentric Ptolemaic system are equivalent. Such a point of view is 
counter to the definition of physical equivalence of frames of reference given in 
Section 49* and is therefore inadmissible. 

The above remarks concerning the privileged character of the harmonic 
system of coordinates should not be understood, in any.case, as some kind of 
prohibition of the use of other coordinate systems. Nothing is more alien to 
our point of view than such an interpretation. Our aim was to clarify the question 
of coordinate systems in the restricted statement of the problem of the theory 
of gravitation discussed in the preceding chapters, where an “ insular ” 
distribution of matter is assumed. The intention was to show that this question 
has the same answer as in the case of the Galilean space-time of the “ special ” 
theory of relativity. Nobody would assert that in the latter case the existence 
of Galilean coordinates implies a prohibition of the use of other coordinates. 
Likewise, in the case of the Theory of Gravitation, the existence of harmonic 
coordinates though a fact of primary theoretical and practical importance, does 
not in any way preclude the use of other, non-harmonic, coordinate systems. 
The harmonic coordinates in Einstein space are no more (though no less) 
privileged than Galilean coordinates in uniform Galilean space. 


94, Friedmann-Lobachevsky Space 

When we stated the problem of solving the gravitational equations for a 
system of masses, we postulated that this system should be isolated and. 
embedded in infinite Euclidean space or, more accurately, in Galilean space-time. 
Such a statement of the problem is undoubtedly admissible when considering 
objects of the dimensions of the Solar system. These dimensions may be 
characterized by the radii of the orbits of the heaviest planet, i.e. of Jupiter 
(778 million km). and of the most distant planet, i.e. Pluto (5900 million km). 
Light from the Sun reaches Pluto in about 5-5 hours. The nearest fixed star 
(« Centauri) is at a distance of 4-3 light years from the Sun, i.e. about 7,000 
times more distant than Pluto. 

It is possible that such a statement of the problem is still permissible when 
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considering a single star cluster or even the whole Galaxy. The dimensions 
of the galaxies} are still some tens of times smaller than their mutual distances. 


But if one considers regions of space so great as to include many galaxies— 
and such regions of space are observable with large modern telescopes—the 
notion of an isolated system embedded in Euclidean space evidently becomes 
i licable. 

The question arises with what to replace the Euclidean background which 
we have hitherto considered? What kind of space-time should one take to be 
the background from which the separate mass systems stand out? 

Another reason for the need to replace the Euclidean background by some 
other is that if one considers masses distributed throughout Euclidean space 
with a uniform mean density the well-known paradox of Seeliger arises which 
amounts to the fact that the Newtonian potential of a uniform mass distribution 
does not exist. At the same time astronomical observation shows that through- 
out the region of space accessible to observation, up to distances of the order of 
10° light years, the distribution of galaxies seems to be uniform on the average. 
On the other hand, if the Newtonian potential does not exist it is clear that the 
form of the solution of Einstein’s equations must be essentially different from 
that which corresponds to an isolated system of masses and which allows the 
gravitational potentials g*” to be approximately expressed in terms of the 
Newtonian potential. Owing to the altered character of the problem the choice 
of coordinates must also be made anew. 

The solution of Einstein’s equations which corresponds to an isotropic space 
with a uniform mass density and which may be taken as the background in 
the study of enormous distances that include many galaxies was obtained in 


1922 by the Russian scientist A. A. Friedmann [24]. As we shall see this solution 
permits the introduction of coordinates in which space possesses the properties 
of Lobachevsky’s geometry. We shall therefore call the corresponding space- 
time manifold the Friedmann-Lobachevsky space. 

The theory of Friedmann-Lobachevsky space may be derived starting from 
the assumption that the expression for ds? is invariant under the homogeneous 
Lorentz group. It may then always be reduced to the form 

ds? = H*(S)(da? — da? — dz? — dz?) (94.01) 
where 


S = Va — af — of — af) (94.02) 
Though this space-time is not itself Galilean, equation (94.01) shows that it 
may nevertheless be mapped conformally on a Galilean space-time. The 
group of homogeneous Lorentz transformations exists in this ‘space and this 
ensures the tsotropy of the space. In particular, the origin of the spatial coordin- 
ates is in no way singled out from any other point in the space ; any fixed point 
may be transformed so as to become the origin by means of a homogeneous 
Lorentz transformation. Thus the homogeneous Lorentz transformations play 


a twofold role: they effect both a transfer of the origin of coordinates and a 
passage to a moving reference frame. 


t By the Galaxy (with a capital ‘“‘g”) one means that stellar system to which the Sun 
belongs. Other similar stellar systems are termed galaxies (with a small “ g”). 
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We write ds? in the form 


- ds? = H?(¢, de?) (94.03) 
We then see that 
1 
Juv = He, 85 9 = Fey Bw (94.04) 
For the Christoffel symbols we get the expressions 
H'(S) 

Pep = gq Ca Becta + 9 Spats — &e Bug) (94.05) 

We use the equation 

ale, al 

Ria = Ga, Fae + Piel Ge — Teele (94.06) 


to form the mixed fourth rank curvature tensor (see (44.01)) and obtain 
_ (2 log H 1dlogH (? log n) ] 
B, a = 


| dS? 8g dS aS 
€ 
x 3 (€, S,,e%,% — €g S,a%a%, + eg 8% eh, — y 5.6%.) 


2dlog H dlog H\? 
+ E is + ( 78 |esl® Sg. — 8p, 8,,) (94.07) 


Here we put « = p, @ = v and sum over p, getting 


d@logH 5dlogH d log H\? 
w= | as? +8 ds 2( ds Je Sy 
@logH Ii1dlogH (/(dlog H\? 2%, 2 
elie eae oie Tuy 
= ds 8 ds ( dS Jew ale aaa 
We calculate the invariant 
6 [d@?logH 3dlogH dlog H\? 
fe dS? S dS ( dS Oe) 
R= 6 (= 3 *) wer 
” — pli +33 oe 
Now we can form the conservative tensor 
Gy =F, — 29,,R (94.11) 
From (94.08), (94.09) and (94.04) we obtain 
@logH 4dlogH (! log =) 
a E ds? +8 ds as) |= bu 
@logH Il1dlogH (- log “) | 2 
get ga 2 (94.12 
af ds? 8 ds as) \etoge (04-12) 
By Einstein’s equations we have 
Sry 
Ga fa, == eo Dis (94.13) 


As we have indicated in the derivation of these equations, the mass tensor 7’, 
in the cosmological case, when conditions at infinity are absent, is determined 


only up to an additive term of the form Ag,,; in this case the constant character- 
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izes not only the mass tensor itself but also the properties of space as a whole, 
and choosing a value for it represents a special hypothesis. 

Our basic aim is the examination of Friedmann-Lobachevsky space, but 
before we pass on to this task we shall touch briefly on the case of a maximally 
uniform space, to which we referred repeatedly in our previous discussions. 
In maximally uniform space the relations (49.12) must be satisfled. They 
may be written in the form 


Ri, ae = K(829ua — 859 ua) (94.14) 


vs 


Comparing this expression with (94.07) we obtain agreement if we put 


dlogH 1dlgH /dlog H\? 

_l = =, 94.15 

is? 8 dS ( ds ve 
2d log H dlog H\? _ : 

= (“3 ~ KH (94.16) 


Taking 1/H as our new unknown function we obtain from (94.15) a linear 
equation, the solution of which has the form 


] 


an bS? (94.17) 
Insertion of this expression into (94.16) gives a relation between the constants 
4ab = K (94.18) 
Putting a = 1 we get b = 4K so that 
1 1 
fe Soe ese T 
A i (94.19) 


and therefore 


2 2 2 2 
dag — dx — daz — daz 


dS? = fl ..1k.(,2 2. 2 2102 
[1 — K+ (xg — af — 28 — 3) ]* 


(94.20) 

In addition to the group of homogeneous Lorentz transformations, which con- 

tains 6 parameters, the expression (94.20) also allows a more general group of 

transformations with 10 paraméters; we shall not state these here (see ref. [14]). 
It follows from equation (94.14) that 


R,, = 3Kg,, 3 R= 12K (94.21) 
whence 
Guy = —3Kg,, (94.22) 
Thus a maximally uniform space corresponds formally to a mass tensor given by 
3K 
i= Fay (94.23) 


Coming back now to the Friedmann-Lobachevsky space we assume that the 
mass tensor has the form 


T,, = p*u,u, (94.24) 
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eee : ; : 
where o Is the invariant density and the uw, are the covariant components ot 
the four-dimensional velocity normalized by the relation 


uu = 1 (94.25) 
In the usual (non-cosmological) statement of the problem the mass tensor (94.24) 
corresponds to “ dust-like’”’ matter at zero pressure (see equation (32.17). 
In the cosmological statement the choice of the additive constant in the pressure 
(as of the constant A in equation (94.23)) represents a special hypothesis, as was 
mentioned, which characterizes the properties of the space as a whole. Our 
choice of mass tensor, though not the only one possible, allows us to construct 
a theory which reflects correctly some features of observed phenomena. 
If the conservative tensor (94.12) and the mass tensor (94.24) are to be 
proportional, the first term in (94.12) must be zero. This gives 


gt ee te at Clog eu d log H\? 4.26 
“a5: "3 as +( ds = ree 
If we put 
H = H?; log H = 2 log H, (94.27) 
equation (94.16) becomes linear 
a°?H, +4 dH, 
738 Bas 0 (94.28) 


‘and may readily be integrated. If we assume that at S = co we have H, — 1, 
which means that space-time is Galilean and the scale factor is the usual one, 
we get 


A 
H,=1—% (94.29) 
and therefore 
2AM. AP 
=}—-—4+— 94.30 
H=) 7 te ( ) 
With this value of H we have 
124 
¢.,=- BAL — ApS ete (94.31) 
and Hinstein’s equations become 
124 Sry , 
We form the invariants on both sides of the equation, getting 
ull of = oe (94.33) 
c S3H 


These invariants are equal to R; therefore we could have used equation (94.10) 
directly with the value of H from (94.30) inserted for calculating the right- 
hand side of (94.33). Since the density o* is positive the constant A must also 
be positive. 

With the use of (94.33) equation (94.32) gives 


u, =H ee (94.34) 
ry 
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and, on going over to contravariant components we get 


he 


= (94.35) 
HS 
We have obtained a solution of Einstein’s equations corresponding to a non- 
vanishing mass density p* uniformly distributed over all space. (In Newtonian 
theory such a solution does not exist.) This result was first obtained by A. A. 
Friedmann in the 1922 paper mentioned above [24]. 
We write the solution obtained in explicit form. We have 


4 
ds? = (1 -_ 5) (da? — da? — dx? — dz?) (94.36) 


where 
8 = Vie — at — at — a (94.02) 
It is interesting to note that this expression is invariant not only under the 
group of homogeneous Lorentz transformations, but also under inversions 
, A? 
My 


wo Bae 


(94.37) 


The latter, however, have no direct physical significance because they transform 
the region S > A into the region S’ < A, and only the case S > A has physical 
significance. It is also easy to find the equations relating the coordinates x, 
to harmonic coordinates 2%, namely 
«(1 4A A? 
pe (l455+ Gs 

The harmonic coordinates do not change under the inversion (94.37). In the 
problem of isolated masses the preferred role of the harmonic coordinates was 
based on the boundary conditions discussed in Section 93, including the con- 
dition of Euclidean behaviour at infinity. In Friedmann-Lobachevsky space, 
however, the conditions are quite different and the “ conformally-Galilean ” 
coordinates, in which ds? is of the form (94.36), are more appropriate to the 
character of the problem. 

We discuss the motion of matter that corresponds to the solution obtained. 
In the equation of continuity 


AV (— 9): p* ut] = 


(94.38) 


a, 0 (94.39) 
we must put 
3Ac? x 
V(— 9) p*ut = On ai (94.40) 


whence it is evident that it is satisfied by virtue of the identity 


x, (3) =° 
2a, xi} = (94.41) 
The velocity field is characterized by the quantities (94.35). We have 
dx, ub, 


dz, u® Lo (94.42) 
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For directness of interpretation we put x) = ct and can then write 


at (94.43) 

dt tt 
whence for each mass 

x4 = vit (94.44) 


where vj 1s constant. Since S? > 0 we must have 
ood, aoe ae (94.45) 
In the auxiliary Galilean space each mass moves with a constant velocity 
proportional to its coordinates and therefore in this space all distances increase 
proportionately with time. This is, properly speaking, only true for the auxiliary 
Galilean space, but qualitatively the conclusion that relative distances 
increase with time remains valid in the physical space also. Such an expanding 
space is thus the background which here replaces Euclidean space. The 


observable physical consequences of this at first sight paradoxical result will 
be examined in the following section. 


The quantities », introduced by equation (94.44) may be considered as the 
coordinates of a given mass measured with a scale increasing with “time” 
t. With this scale the coordinates of each mass point are constant. Therefore, 
from the point of view of the equations of motion the v; are a type of Lagrangian 
coordinates (as distinct from the Eulerian coordinates z;.) We introduce into 
ds® as spatial variables the three quantities v; and as a time variable the 
quantity + which is proportional to S and given by 


1 
t= ¥/ |e = oat + 2+ “| (94.46) 
Our transformation is then 


UT Ay 


= a: = a (94.47 
"= Tama) 
Evidently relation (94.36) is identically satisfied. We have 
8 la a dy. O48 
eT] ea 
and 
1 7 v4 dv; 
fs a shh Oe 94.49 
Om Tay Fae ee ai 
whence 
c? dt? — (da? + da? + dx?) = ce? dx? — v° do?® (94.50) 
where ; 


(dv)? —[v x dv]?/c? 
(1 — v?/c?)? 
This is the square of the element of arc length already known to us from the 
Lobachevsky-Einstein velocity space of Section 17. In place of (94.51) we can 

also write ; 


do? = (94.51) 
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1] VIVE 
do? = al + eo es dur (94.52 

If instead of A we introduce a constant « such that 
S07: A=ca (94.53° 

we obtain for ds? 
4 
ds? = (2 = “| (c2 dt? — +? do?) (94.54) 
T 


The quantity + may be called the proper time in the auxiliary Galilean space. 
The physical proper time on the other hand is the quantity 7 determined by 
the equation 


4 2 2 
r={ (1-5) dt = 1 —~ — 2a log * (94.55) 
“G T. ie 


Here the convention is to count time from that very distant epoch (some 
thousands of millions of years ago) when the galaxies observed to-day were 
much closer together than they are now. There is no sense in applying the 
above equation to yet earlier epochs because it ignores the direct gravitational 
interaction of the galaxies which could have been considerable at that time. 

The spatial part of ds? ,which corresponds to a fixed value of the universal 
time 7 (and therefore also to fixed t) is equal to 


a 4 
dl? = (1 = *) do? (94.56) 
Tv. 


This describes a Lobachevsky space of constant negative curvature. Its 
volume is infinite. 

We consider the mass of those galaxies whose velocities in the auxiliary 
Galilean space are within given limits. If p* is the invariant density this mass is 


dM = p* dV (94.57) 


where dV is the volume element in the Lobachevsky space with the metric 
(94.56). As is easy to calculate from (94.52) the volume element in the velocity 
space is 


dv, dv, dv 
dV, = —-~ + 3 
Therefore the volume element in the Lobachevsky space is 
a\§ | 
dV = “(1 _ *) dV, (94.59) 
T 
and the mass dM is a\ 6s 
dM — e*ea(1 _ *) dV, (94.60) 
T. 


Phe equation (94.33) for the invariant density may be written in the form 


Po 3c2 A 


° Sey SL ASS 


(94.61) 
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whence, using (94.53) we get 


a\§ 3 
o*ce(1 _ *) ae (94.62) 
= ary 
Inserting this expression into (94.60) we get 
9 
dM =~ ay, (94.63) 
airy 


which can also be written as 


3x dv, dv, dv, 


(dM = 
Qn (1 — ve? 


(94.64) 
In the study of the red shift in the spectra of distant galaxies it is customary 
to characterize the shift by the speed v* which would produce it if it could be 
ascribed entirely to Doppler effect in ordinary Galilean space. From the theory 
of the red shift which will be developed in the following section it follows that, 
as long as it is small compared to the speed of light, this effective speed is related 
to the speed discussed here by the equation : 


T+a 


To Oh 


al 


v (94.65) 


With the aid of this relation the previous equations may be more easily inter- 
preted. If we put : 9 
ce ap = “| (94.66) 

TH+o% 


equation (94.64) can be written approximately as 
oo* 
all = ae det deZ dey (94.67) 


Integrating over solid angle and over the absolute magnitude of the effective 


specd, we obtain the relation 
y(AL — My) = 2x*(u*4 — v%) (94.68) 


Hence we can get an estimate of «* from an estimate of the mass Mf — M, 
of those galaxies which have effective Doppler speeds in the range between 


vy and v*. 


95. Theory of the Red Shift - 

The hypothesis that the structure of very large regions of space containing 
many galaxies is of the nature of the structure of Fricdmann-Lobachevsky 
space found unexpccted confirmation in the phenomenon of the red shift in 
the spectra of galaxies discovered by the astronomer Hubble [25]. It was 
shown that all the lines of the galactic spectra are shifted towards the red 
end of the spectrum and that this shift is the greater the more distant the 


alaxy. 
: In this section we shall give a theory of the red shift based on the hypothesis 


mentioned. 
We first write the expression for ds? in the form 
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4 
ds? — (1 = *) (c? dt? — dx? — dy? — dz?) (95.01) 


T. 


Here « is a positive constant and 7 is given by 
2 
r=,/(0 -") a ell Be ad (95.02) 
c 


Expression (95.01) actually contains not one constant but two because the 
origin of time reckoning or, equivalently, the value of + for-the present epoch 
remains unknown. In principle, the theory of the red shift makes it possible 
to determine these two constants by observation. Also, the theory permits of 
verification if the mean density p* of matter in space is known. 

The change of the frequency of light arriving from a distant star arises as 
a result of two effects: the Hinstein effect (Section 51) which is operative 
when light passes from a region with one value of the gravitational potential 
Joo to a region with another value, and the Doppler effect (Section 13) which is 
operative when the radiating system is in motion. 

The fact that Friedmann-Lobachevsky space is conformally-Galilean con- 
siderably simplifies the discussion of the law of light propagation. For in a 
space having the squared line element (95.01) the law for the propagation of 
a wave front has the form 


1 [é@w\? @w\* Ow \? Ow)? 

Se: a hees ee —}|=0 95.03 

c (3) (3) 7 (F) i (Z) | 
which is exactly the same as the corresponding law in Galilean space-time. 


The equations of the characteristics will therefore be the same as in the Galilean 
case ; In particular the surfaces 


t —" — const ; t+ " = const (95.04) 
c c 


are characteristics. The entire discussion related to the law of wave front 
propagation and the form of the characteristics can be taken over without 
change from the Galilean case to that of conformally-Galilean space-time. 
In particular the whole theory of the Doppler effect remains unchanged. 

After these preliminary remarks we go on to derive the equations for the 
change of the frequency of light. 

We shall use a coordinate system with its origin in the Sun or on the Earth 
(it is immaterial which we choose because of the smallness of the relative dis- 
tance of Sun and Earth compared with the distance from other galaxies). 
Assume that at ¢ = ty light is emitted by a distant star which then is located 
at the distance r = ry. The value of To Corresponding to time ¢, and distance 


71s ; 
a 
to= /(@— ‘a) (95.05) 
according to (95.02). By the equation of the characteristic, 


To 


a ee a ; (95.06) 
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this light reaches the Earth (r = 0) at the instant ¢ = ¢, + r,/e which corre- 
sponds to a value of + given by 


Y9 
t= ty = (95.07) 

From this equation and (95.05) we get 

To fo — a) 

t 7 (: + (role) ae 
On the other hand, if the emitting star moves with the velocity corresponding 
to the mean velocity of matter in Friedmann-Lobachevsky space, then, 
according to (94.34) 


;  (v>0) (95.09) 


where v is the constant speed of the star in the auxiliary Galilean space. From 


(95.08) and (95.09) we obtain 
fo J (: = ‘) (95.10) 
T c+v 


We now consider the frequency change. Let «» be the proper frequency 
of the emitter and w> the frequency in the auxiliary Galilean space in the 
frame of reference of the emitter. The two quantities are connected by the 


relation 
2 
a a *) (95.11) 


To 


The conversion factor is the value derived from (95.01) of the quantity 
V(Joo)/c at the time and place of emission. In the auxiliary Galilean space an 
emitter of frequency w} is moving with speed v away from the Earth. The 
frequency «* received on the Earth is then obtained by the usual theory of the 


Doppler effect and is 
c—v “ 
ot = at ei ( = *) (95.12) 


According to (95.10) this relation may be written in the form 
to* = Toto" (95.13) 


; now remains to pass from the frequency w* in the auxiliary Galilean space 
) the frequency « in physical space. The transition is effected by an equation 
1alogous to (95.11); but the value of the quantity +/(go9)/e has to be taken 
m the place and instant of observation. The relevant equation is 

1 1 a)? 

eo =(! — *) (95.14) 


a) @ 


rom (95.11), (95.13) and (95.14) we get finally 
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2 2 
-(1 = *) _ at - =) i (95.15) 
Tv To 


Here, w, is the proper frequency of the emitter, e.g. of a hydrogen atom, and 
« the frequency of the light reaching the Earth from an identical emitter on a 
distant star. Since + > t, and the factor of w in (95.15) is an increasing function 
of t, we have ‘w <w,; thus the frequency decreases and the displacement of a 
visible spectral line is therefore towards the red end of the spectrum. 

In our proof the inclusion of the change of +/ oo in passing from the star to 
the Earth corresponds to taking the Einstein effect into account whereas the 
transformation according to equation (95.12) takes account of the Doppler 
effect. 

Let us now investigate the change in the amplitude of a light wave travelling 
from the star to the Earth. The distance of a star can be judged from the energy 
current density which is proportional to the square of the amplitude, if stars 
of the same radiative power are compared. (Novae at their period of maximum 
brightness may be assumed to have such equal power.) 


In a.space which has (94.01) for its squared: line element the wave equation 
has the form 


a ay 
| Fee : 
mal 6, me) 0 (95.16) 
and if we put 
= ai (95.17) 
7 
where )* is a new function, the wave equation may be written as 
HOy* — p*OH = 0 (95.18) 
Here U1 is the Galilean d’Alembert operator, so that 
aye 
Oy* =e “b (95.19) 


u Cx 
Ti 
The function H varies incomparably more slowly than )*, therefore the second 
term in (95.18) may be discarded and the wave equation may be written as 


Cy* = 0 (95.20) 
(this approximation has no effect at all on the expression for the energy current 
in the wave zone, which alone interests us). 

We now take the origin of coordinates to be on the star and consider a 
spherical wave radiating from there. The quantity {* will then have the form 


yr : f ( 7 ‘) (95.21) 
and therefore 2 = ( a 4) =a pl (. = ‘) (95.22) 


The square of the amplitude of v will be a i i 
quantity proportional to th 
current density J. Therefore we may put sas ae re 


= B 
~ A apy oo) 
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where B is a positive constant equal to the energy flux into unit solid angle. 


Indeed, it follows from the expression for ds? which appears in spherical polar 
coordinates as 


4 
ds? — (1 er *) {c? dt? — dr? — r(d9? + sin? 3 dp2)} (95.24) 


that the surface element on a sphere about the source ist 


4 
dS = ra(1 = *) sin 9 d9 dp (95.25) 
ic 
Hence the flux of energy through an element of the surface S is 
1 dS = Bsin 9.d9 do (95.26) 


which shows that B is the flux through unit solid angle. 

The quantity B is constant in the frame of reference in which the wave 
equation and its solution (95.22) have been stated. But it must be remembered 
that B is a dimensioned quantity whose value depends on the choice of units 
and of the standards used for its measurement ; in the general case the passage 
to a new frame of reference has an effect on the standards. The dimensions 
of an energy flux are energy divided by time, or in other words, action times 
frequency squared. Action is an invariant and the scale of action does not 
change in the passage from one frame of reference to another. Therefore the 
energy current changes in such a transition in the same way as the square of a 
frequency. If the frequency of a quantum emitted on the star and reaching 
some given region of space is taken as the standard of frequency, the energy 
current will be numerically constant. But if one takes in every region of space 
a “local” standard of frequency (for instance the frequency of a quantum 
corresponding to the same spectral transition but emitted in the region in 
question and not on the star) the numerical value of the energy flux will 
change in the ratio w?:w?. Naturally measurements made on the Earth 
employ “terrestrial” and not “ stellar” standards, and this corresponds to 
the second point of view. Therefore, expressing the energy flux and its density 
in “‘ terrestrial” units we must put} 


“se 
B= By (95.27) 
0 
Byw? 
pee ee 95.28 
and i w2r2(1 — at)! ( ) 


where B, is the value of the energy flux into unit solid angle near the star ; 
for stars with equal emissive power (i.e. with equal absolute stellar magnitude) 


the value of B, is the same. Putting 
pea= (95.29) 
R2 
where R is the “distance” determined by the apparent stellar magnitude 


iy 4. 2 . 
t h esult agrees with the conclusion of Landau “nd Lifshitz [23 l which they reach d 
T is T e 


from other considerations. 
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(brightness) we have 
2 
R= 20 (1 = *) (95.30 
Tt. 
In this equation r is the distance, in the auxiliary Galilean space, from th« 


star to the Earth in the frame of reference attached to the star. We express 
rin terms of t and ty. In the frame of reference mentioned we have 


for t=); r=O0O and t= 17; 

r 2r (95.31) 
for t=) +73 r=r and t= (ra(70 + =}} 
eee 
whence -= (95.32) 

C 2% 
Inserting this value of r into (95.30) we get 
2. 72 2 

On. “(1 = *) (95.33) 

c @ 2% 
Also, equation (95.15) gives 

a 2 
i eee eee (95.34) 


@ tT) (1 — a/t,)? 


The values of w,/w and of R for different galaxies are connected by the relation 
which can be obtained from these two equations by eliminating ty. To perform 
this elimination conveniently we introduce another auxihary quantityt y in 
place of ty to which it is related by 


(+ — To)(t + a) 


ree (97.35) 
and = we (95.36) 
For various galaxies t) may vary within the limits 
CL tp < ot (95.37) 
so that the limits of y will be 
0O<y<n (95.38) 
Inserting the value of ty into (95.34) we get 
@o (Fa + yt +a + ay) (95.39) 
o (t+ + a)? 
or —=1+y+ ear. y (95.40) 


Also, inserting +, from (95.36) and w,/a from (95.39) into the equation (95.33) 


+ There should be no danger of confusion between this quantity and one of the 
coordinates in (95.01) ‘and (95.02). 
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for Ri/c we get 


Ro (t — a)? + hy?) 
a ee (95.41) 
We put aw Mtt% 
e pu ira (95.42) 


re Bye < 
Then we have ee pape (95.44) 
5 ne (eae 


The quantity 4 has the dimensions of inverse time and the quantity (95.44) 
is a dimensionless number. Replacing + and « in (95.41) and (95.39) by these 
quantities we get 


(95.45) 


Dg — @ 


pr 
=yt 37 aay" 


These equations embrace the whole theory of the red shift. 

The general trend of the dependence of the shift (wy — «)/w on distance is 
the following. 

For small shifts (small y) we can take the right-hand sides of the two equations 


to be equal. Then we have 
Wo —- 


(ad Sees (95.46) 


G@) 


and the relation between the red shift (w, — w)/w and the distance RF is linear. 
[he coefficient of proportionality in this linear relation may be determined 
‘rom observation and this gives a value [25] for the reciprocal of h, of the 


order 
= 4 x 10° years (95.47) 


ole 


The earlier literature gave a value of 210° years for 1/h ; the difference is due 
© an inaccuracy in the earlier estimates of the distance R.) The value (95.47) 
,orresponds to the following value for h 


h=—8 x 10-38 sec (95.48) 


For large shifts, ie. very large distances, the nonlinear terms in equations 
95.45) must have an effect. Their influence must show itself in the fact that 
he shift increases with distance more slowly than initially ; the quantity h 
letermined from (95.46) by observations on the furthest galaxies must prove 
o be somewhat smaller than if it is determined from observations on nearer 
alaxies. At the greatest distances, which probably cannot be observed in 
ractice, the relation must again tend towards the lnear, but with a smaller 
actor of proportionality. 
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In astronomy it is customary to call the left-hand side of (95.46) a speed 
and to express it in kilometres per second. However, it 1s necessary to remember 
that although this quantity has the dimensions of a speed it is not the speed 
of motion of anything. This is evident, for instance, from the fact that if 
w < 4u, its magnitude exceeds the speed of light. Also the quantity c(w» — w)/w 
is not a Doppler speed even if the whole shift is interpreted as the result of a 
Doppler effect in Galilean space. It follows from the ordinary theory of the 
Doppler effect (see (95.12)) that the effective Doppler speed would be the 

uantity ; 

Yer = gone (95.49) 
Ws +0 


which is close to the quantity on the left-hand side of (95.46) only if the shift 
is small. 

The equations (95.45) are rigorous and give the dependence of the shift on 
distance for any value of the former. It is possible that in time the accuracy 
of observations will increase so much that their analysis on the basis of these 
equations will make it possible to study the coefficient of y? in the second 
equation of (95.45). This will then make it possible to obtain from observations 
on the red shift not only the constant A (known as Hubble’s constant) but also 
the mean density p*. At present, however, p* can only be determined quite 
roughly by counting the number of galaxies in sufficiently large volumes of 
space on the assumption that each galaxy has the same mass as our Galaxy 
(of the order of 101! solar masses or 2 x 10“*g). Such an estimate gives a density 


f the ord 
of the order o* = 4 x 10-* g/cm? (95.50) 


A more vivid idea of the magnitude of this density can be given as follows : 
The volume of the terrestrial globe is 108332 x 10?’cm? or, roughly, 10?’cm?. 

In space a volume of this size contains in the mean 0-04g of matter.t+ 
If p* and A are assumed to be known the value of the dimensionless constant 
» — ote" (95.51) 

3h? 

may also be determined. This constant appears both in the equation (95.45), 
which may be written as 


Rh i ee 
—=y+sy 
e A (95.52) 
Gg — ; b 3 
— yr 
Boat. 3 es hs icles 
and in the relation (95.44), which gives 
dry 
bea (95.53) 
(7 +)? 
If we take as values 
kh = 8 x 107!8sec-1 
oP each Xx 10>? ej/em§ (95.54) 


») 
~ 
Pi 3 

t We stress once again that the quantity p* is known with very small accuracy ; its true 
value may be many tires grester or smaller than the value given. 


x 10-7 cm?/(g. sec?) 
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we get 
ae 1 
=355 (95.55) 
It is then possible to use (95.53) to determine the ratio t/a, namely 
ai 
—-&10 
~ (95.56) 
Hence and from (95.42) we have 
co tt 150; hee OD 95.57 
== = fou; a= U . 
oa? (95.57) 
and, using the value of 1/h given by (95.47) we get 
t= 6 X 10° years 
a= 6 x 108 years 2508) 
and from (94.45) 
T = 3-2 x 10° years (95.59) 


The following should be borne in mind. The ratio t/a given by equation (95.53) 
is real only if 6 is less than unity, i.e. if for given h the density o* is sufficiently 
small, as in fact appears to be the case. There exist also solutions of Hinstein’s 
equations corresponding to 6 >1, but for these the conformally-Galilean 
coordinates we have used are inapplicable because they become complex. 
It is true that one can introduce other coordinates that are real, but the expres- 
sion that one gets for ds? when b > 1 leads to such strange notions concerning 
the properties of space and time (a finite volume varying periodically with 
time, etc.) that one can hardly ascribe physical meaning to this case. There- 
fore we shall not discuss the solution for the case 6 > 1. (This solution was 
also found by Friedmann.) 

As for the solution we have discussed, which corresponds to 6 <1 (the 
Friedmann-Lobachevsky space) the following remarks should be made con- 
cerning it. 

In the first place it is incorrect to see in it some kind of a model of “ the 
Universe as a whole”: such a point of view is unsatisfactory in a philosophical 
respect. At most Friedmann-Lobachevsky space can serve as the background 
for a limited number of galaxies just as Galilean space serves as a background 
for objects such as the Solar system. 

The actual applicability of Hinstein’s equations in their classical form to 
such enormous dimensions is not so well established as in the case of phenomena 
on a more limited scale. It is not out of the question that on a cosmic scale 
these equations will require modification and generalization. 

Among the consequences of the solution that we have discussed the most 
paradoxical is probably the relative shortness of the time J which has elapsed 
since the galaxies were close together. This time is comparable to the age of the 
Farth’s crust as determined by the radioactive decay of uranium, while it 
would have been natural to expect the “cosmic”’ scale of time to be much 


greater than the “ terrestrial ” 
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The fact of the ‘ dispersal” of the galaxies, in all its unexpectedness, is so 
well established, however, that one need not have any doubts about it. The 
number of binary stars which is abnormally large from the point of view of 
stellar statistics, also indicates that in the past the density not only of galaxies 
but also of stars was probably much greater. oe 

In judging the theory it is also necessary to bear in mind that it is a successful 
attempt to resolve the Seeliger paradox referred to in Section 94 which arises 
when one studies a uniform mass density in Euclidean space. 

It is therefore beyond doubt that the theory due to A. A. Friedmann which 
we have described is an important step in the study of space on a cosmic scale. 


96. The Development of the Theory of Gravitation and of the Motion of 
Masses (A Critical Survey) 

Hinstein’s basic paper that laid the foundations of the theory of Galilean 
space and time made its appearance in 1905 under the heading ‘‘ On the Electro- 
dynamics of Moving Bodies”. This paper linked together for the first time the 
principle of the constancy of the speed of light and the principle of relativity, 
according to which corresponding physical processes exist, and proceed identi- 
cally in any two inertial frames. To make these two principles compatible it 
proved necessary to re-examine the notions of space and of time and this led 
to a theory of a unified space-time. This theory was given the name of “ Theory 
of Relativity ”. It is worth noticing that this name did not yet appear in the 
title of Einstein’s paper. 

We know now that the principle of relativity reflects the uniformity of space- 
time : either the total uniformity of Galilean space, or—in a generalized theory-—- 
uniformity at infinity, which under certain conditions allows one to introduce 
harmonic coordinates. In both cases the principle of relativity is expressed by 
the existence of the group of Lorentz transformations within a definite class 
of coordinate systems. 

During the period when Einstein created his theory of relativity (before his 
theory of gravitation) the idea that it is possible for physical processes to 
influence the space-time metric had not yet arisen; in other words, it was then 
tacitly assumed that the metric was rigid. But if this assumption is made, the 
principle of relativity expressed in its most general form (in arbitrary coordi- 
nates) leads to uniform space, and if one makes the additional assumption that 
there is no absolute scale, it leads just to Galilean space (Section 49*), 

The basic idea of the theory of relativity consists precisely in establishing 
the metri:, of space-time and in pronouncing the requirement that the form of 
the laws of physics should correspond to this metric; since the metric expresses 


the uniformity of space-time the form of the laws of physics must take into 
account this uniformity. This requirement means that the equations expressing 
physical laws must be covariant with respect to Lorentz transformations. The 
requirement mentioned has a very great heuristic significance because it severely 
limits the possible form of physical laws. It is, therefore, not surprising that 
the applications of the Theory of Relativity affect all branches of physics and 
are too numerous to list. 

In the period between 1905 and 1915 Einstein created his Theory of Gravita- 
tion. In 1916 he published his fundamental paper under the heading “The 
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Principles of the General Theory of Relativity ’. This title reflects Einstein’s 
point of view regarding his theory—a point of view which we cannot accept as 
correct. As we have already shown in the Introduction, the basic idea 
of Einstein’s theory of gravitation is by no means that ‘the principle of 
relativity 1s generalized for accelerated motion—such a generalization is 
not possible. The basic idea consists in assuming that physical processes can 
influence the space-time metric and in recognizing the unity of metric and gravi- 
tation. These ideas could be formulated with help of the formalism of 
Riemannian geometry. As to the relativity principle, one may say that it does 
nes play any essential part in the theory of gravitation. The concept of 

relativity of motion ” is restricted rather than widened in this theory. This 
1s quite natural since the inclusion of the gravitational field in the metric field 
destroys the uniformity of space-time, which is only locally restored by the 
application of the equivalence principle. The local analogue of an inertial frame 
of reference, the “ freely falling ” frame, is only an approximate realization of 
the former. On the other hand, the harmonic coordinates, though not local, are 
only partly analogous to inertial frames. _ 

The fact that the theory of gravitation, a theory of such amazing depth, 
beauty and cogency, was not correctly understood by its author, should not 
surprise us. We should also not be surprised at the gaps in logic, and even 
errors, which Einstein permitted himself when he derived the basic equations 
of the theory. In the history of physics we have many examples in which the 
underlying significance of a fundamentally new physical theory was realized 
not by its author but by somebody else and in which the derivation of the 
basic equations proposed by the author proved to be logically inconsistent. 
It is sufficient to point to Maxwell’s theory of the electromagnetic field. This 
theory actually put an end to the conception of mechanics as the basis of 
physics, but its author and also Hertz, who did so much in verification of the 
theory, wholly accepted a mechanical picture. Only Lorentz established with 
full clarity the physical sense of Maxwell’s equations, showing that the electro- 
magnetic field itself is a physical reality (i.e. is itself material), being capable 
of existence in free space and not requiring a special carrier. t 

As for the cogency of the derivation, we can recall that Maxwell [27] prefaces 
the derivation of his famous equations by the exposition of some chapters in 
mechanics and bases the derivation itself on Lagrange’s equations (of the second 
kind). We know now that from the point of view of logic, a denvation of 
Maxwell’s equations on the ‘basis of mechanics is not possible. But great, 
and not only great, discoveries are not made by the rules of logic, but by guess- 
work, or in other words by creative intuition. 

It is interesting to trace the path along which Einstein reached his equations 
of gravitation. This can be done by using Einstein’s fundamental paper of 
1916 together with his other scientific work and his autobiography published 
in 1949 on the occasion of his seventieth birthday [26]. 

As early as 1908 it became clear to Hinstein that any attempt to introduce 
gravitation within the framework of the theory of relativity in Galilean space 
could not be successful for the following reason. Such a theory predicts a 

oe * his example of an author not fully understanding the physical 
Palas pecagetes ae ra enike Einstein himself. (Autobiography [26].) 
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dependence of the inertial mass on the internal (and kinetic) energy oi a body, 
but cannot give a dependence of gravitational mass on energy as long as the 
gravitational equations are taken to be linear. At the same time one has the 
fundamental fact that inertial and gravitational masses are equal, as is proved 
by the fact that the acceleration of a freely falling body is independent of its 
composition. Einstein interpreted this fact in the sense of the local equivalence 
of fields of acceleration and of gravitation. 

Hence Einstein concluded that if ‘accelerated frames of reference” are 
accepted gravitation and acceleration become indistinguishable. In his Auto- 
biography Einstein writes: “If therefore gravitational fields of arbitrary 
extension, not restricted a priori by spatial boundary conditions, are regarded 
as possible the concept of an inertial frame becomes completely empty. The 
notion of ‘ acceleration relative to space’ then loses all meaning ; so does the 
Principle of Inertiat and with it Mach’s paradox.” ft 

Further, Einstein concludes from the equivalence of all reference frames 
(inertial and non-inertial) that the gravitational equations must be covariant 
with respect to arbitrary coordinate transformations. 

Such were the ideas that represented Einstein’s starting point in 1908. 
Let us analyse them. In the first place there is here an incorrect initial assump- 
tion. Einstein speaks of arbitrary gravitational fields extending as far as one 
pleases and not limited by boundary conditions. Such fields cannot exist. 
Boundary conditions or similar conditions which characterize space as a whole 
are absolutely essential and thus the notion of “ acceleration relative to space ” 
retains its significance in some form or another. As for Mach’s paradox, it is 
based, as is well known, on the consideration of a rotating body which has 
ellipsoidal form and a non-rotating body which is spherical. A paradox arises 
here only if one denies a meaning to the notion of “ rotation relative to space ” ; 
both bodies, that which rotates and that which does not, then appear to be 
equivalent and it cannot be understood why one of them is spherical and not the 
other. But the paradox vanishes as soon as one accepts the legitimacy of the 
notion of “ acceleration relative to space ”’. 

The essence of the error committed in the initial assumption consists in 
forgetting that the nature of the equivalence of fields of acceleration and of gravitation 
is strictly local. Related to this is the fact that a non-local physical definition 
of an accelerated frame of reference is impossible, because a box, rigid scaffolding, 
etc., of the kind used by Einstein represents an idealization which is only 
applicable in inertial frames of reference, and not in accelerated frames. But 
physically the “general principle of relativity ”, in the sense that correspondmg 
processes exist in arbitrary reference frames, does not hold at all. Therefore 
Einstein’s conclusion that all reference frames are physically equivalent, is 
without foundation. But this is the assumption on which Einstein bases his 
further arguments, in particular his derivation of the covariance of the required 
equations of gravitation. Naturally it does not follow from the incorrectness of 
his assumptions that the conclusions drawn are incorrect: the equations of 
gravitation are indeed covariant, but their covariance is not their distinctive 


t Newton’s First Law (V.A.F.). 
} The above quotation is a direct translation from the German text by N.K. 
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property; it could be achieved with any tneory. Thus in Chapter IV of this 
book we formulated the ordinary theory of relativity in a covariant manner ; 
it is interesting that Einstein himself points out the possibility of such formu- 
lation [26], a fact to which we shall return shortly. 

Thus at the first stage of constructing a theory of gravitation Einstein 
accepted as an initial assumption the equality of inertial and gravitational mass, 
the principle of the equivalence of acccleration and gravity and the require- 
ment of covariance for the equations. The idea of Riemannian geometry and 
the idea of the unity of metric and gravitation were as yet absent. 


The requirement that the equations be covariant played a very large part 
in the construction of the theory. The circumstance that Einstein “ derived ” 
it from the principle of equivalence and ultimately from the equality of inertial 
and gravitational mass shows that at the time he was creating his theory 
Einstein considered the general covariance of the equations to be a specific 
peculiarity of the theory of gravitation. This is said explicitly in the intro- 
ductory part of the fundamental paper of 1916. Subsequently it was pointed 
out to Einstein that general covariance by itself does not express any physical 
law and, apparently, he agreed with this. But his agreement was rather formal, 
because in fact to the end of his days Einstein connected the requirement of 
general covariance with the idea of some kind of “ general relativity ” and with 
the equivalence of all frames of reference. He never realized the difference be- 
tween physical equivalence (or physical relativity) in the sense of the existence 
of corresponding processes in different frames of reference and that formal 
equivalence which consists in the possibility of using. arbitrary coordinate 
systems independently of whether corresponding physical processes do or do 
not exist in the systems concerned. In the theory of Galilean space-time 
physical relativity is directly connected with the uniformity of the space. In 
the theory of gravitation, where space-time Is non-uniform and its metric de- 
pends on the processes occurring in it, things are somewhat more complicated. 
But here also the existence of a kind of relativity within a class of harmonic 
coordinate systems is a result of the uniformity of the space at infinity (see 
Sections 49* and 93). Thus in all cases where there is physical relativity, it is 
connected with uniformity, i.e. with objective properties of space-time and not 
at all with the use of this or that coordinate system. (The choice of coordinate 
system has an effect only on the simplicity of the way physical relativity can 
be expressed: it is expressible most simply in such coordinates in which physically 
equivalent systems are related by linear transformations.) The mathematical 
expression for physical relativity does not reduce to tlie covariance of differential 
equations: the latter is a necessary but not a sufficient condition for the existence 
of the former. But Einstein identified physical relativity with the covariance 
of differential equations, actually believing that general covariance 1s an Cx- 
tension of the concepts of the physical relativity that formed the basis of his 
1905 theory. This is the origin of Finstein’s conviction that there exists a 
“ General Principle of Relativity ”. This conviction evidenced itself in the fact 
that he called his theory of gravitation ‘ The General Theory of Relativity: 
and that in later years he stubbornly adhered to this term. ute then nanied his 
which opcrated in Galilean space, the “ Special ” Theory of 
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concepts of physical relativity and formal covariance is particularly clearly 
shown at one point in Hinstein’s autobiography where he himself formulates 
the “ special ” theory of relativity (i.e. a theory in which, according to Einstein 
there is no “ general relativity ”’) in the general covariant form, which, according 
to Einstein, expresses the idea of ‘ general relativity ”’. 

Thus in the period immediately preceding the creation of the theory of 
gravitation the idea of general covariance appeared to Einstein as a physical 
idea and it guided him in his researches. 

The next important step in the creation of the theory was made when 
Einstein introduced the generalized expression for the square of the interval, 
i.e. when he, in fact, introduced the hypothesis of Riemannian geometry for 
space and time. Einstein approached the idea of Riemannian geometry from 
the direction of requiring the general covariance of the equations. He made 
use of the fact that in the “‘ absolute differential calculus ” (tensor analysis) 
which by that time had been fully developed by Ricci, Levi-Civita and others, 
equations appear from the start in generally covariant form. Thus the require- 
ment of general covariance played an important heuristic role, in spite of the 
fact that the essence of the matter lay not there, but in the new hypothesis 
concerning the character of space-time geometry. 

In solving the problem of the form of the gravitational equations an important 
step was made by Einstein in assuming that purely ‘“‘ geometrical ”’ quantities 
namely the coefficients g,,, in the expression for the squared interval should be 
considered to play the part of the gravitational potentials, and that no other 
quantities need be introduced. From among the multitude of possible non- 
linear generally covariant equations this assumption served to single out almost 
uniquely one set of equations. To achieve full uniqueness some supplementary 
considerations had to be introduced, such as the requirements that the mass 
tensor be conservative and that in the absence of masses there should be a 
solution corresponding to Galilean space. 

These two ideas——the idea of generalizing the metric and the idea of the unity 
of metric and gravitation—-were decisive. Operating with them, Einstein was 
led to his equations of gravitation 


R,, —49,,R = —xT (96.01) 


which contain the essence of this theory and represent one of the greatest 
achievements of human genius. These equations are given in Hinstein’s 1916 
paper. 

Subsequently Einstein made attempts to alter these equations but the 
original form remains the only correct one, provided only the meaning of 
the mass tensor 7’, is interpreted correctly. We have in mind the circumstance 
that in the cosmological case the mass tensor is determined only apart from a 
term proportional to the metric tensor: the factor of proportionality in that 
term characterizes not only the mass tensor itsclf but also the properties of 
space as a whole (the factor plavs the part of a cosmological constant). Einstein 
also attempted to find equations not involving « mass tensor for some “ uni- 
versal field ” (Gesamtfeld) that was to include gravitation: apparently he con- 


t Here we use the word “‘ geometry” in the sense of objective properties of space and time, 
and not in the sense of the science of these properties. 
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nected these attempts with nis hopes of creating, independently of quantum 
theory, a theory of elementary particles that were to be the singularities of the 
field. These attempts ended in failure and we only mention them because the 
idea of considering singularities of the field in place of the mass tensor was 
also pursued in the work of Einstein and his school on the derivation of the 
equations of motion from the gravitational equations. 

Chronologically the first of these papers were those of Einstein and Grommer 
and of Einstein, published in 1927 [28, 29]. In this work the following question 
was posed and solved in principle. If the masses are considered as singularities 
of the field, is it possible to prescribe arbitrarily the motion of these singularities, 
without violating the gravitational equations? This question was answered in 
the negative: it emerged that the motion of the singularities could not be 
prescribed, because it is a consequence of the gravitational equations. Hence 
it follows in particular that the principle of the geodesic is not independent 
of the gravitational equations but is a consequence thereof. It is of importance 
to note that this result is conditioned by the non-linear character of the 
gravitational equations. 


In spite of the enormous importance of this result the two papers of Einstein 
just mentioned aroused little interest and more than ten years passed before 
they were followed up. 

The development of the ideas laid down in the paper by Einstein and Grommer 
in 1927 began in the years 1938-39 quite independently in two directions. 
The work of Einstein, Infeld and their collaborators is concerned with the one 
development, the work of Fock and his collaborators with the other ; the 
papers of Papapetrou, which appeared after 1950, can also be counted as 
going in the second direction. 

We examine first the work of the Einstein school. 

In a series of papers by Einstein, Infeld and Hoffman (1938) [30] and by 
Kinstein and Infeld (1940) [31], (1949) [32], a method was elaborated which made 
it possible to derive from the gravitational equations and to write in explicit 
form an approximation to the equations of motion for pointlike singularities 
of the field, which represent spherically symmetric bodies of infinitesimal size. 
The equations were obtained both in the first (Newtonian), and in the following 
approximation. The authors were guided by Einstein’s idea of the undesirability 
of introducing a mass tensor and therefore started with the gravitational 
equations with a vanishing right-hand side. (Incidentally, Infeld showed in 
1954 that the mathematical work becomes incomparably simpler if, from the 
start, one introduces a mass tensor involving Dirac delta-functions, corres- 
ponding to spherically symmetric singularities.) In attempting to find an 
exact solution of the mechanical problem of the motion of the field singularities 
the authors also introduced infinite series in powers of a parameter inversely 

i to the speed of light. 
ag errant naeg the ae of the work of Einstein’s school is made 
difficult by the fact that the coordinate system is not specified from the start 
(in the way we specified the harmonic coordinate system), but instead is deter- 
mined from approximation to approximation by means of various supple- 
mentary conditions which are sometimes introduced explicitly and sometimes 
implicitly. In general, however, one can say that the coordinates that are in 
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fact employed in this work differ so little from harmonic coordinates, that the 
differences have no effect on the equations of motion in second order. If one 
denotes the harmonic coordinates by (t, 2;) and the coordinates used by Einstein 
and Infeld by (¢’, 2’) then in orders of magnitude we have 


4 L 5 
xf =a +0 (4). =t+0 (+) (96.02) 
c 


where LZ and q are the parameters we have used repeatedly to characterize the 
orders of magnitude of length and velocity.. This question is discussed in more 
detail in the author’s paper [45]. 

In those cases when approximations higher than the second are discussed 
another question arises, in addition to that of the indeterminateness of the 
coordinate system. It becomes unclear whether the formal solution obtamed 
has physical sense. This doubt is due to the fact that the authors of these 
papers do not impose the condition of outward radiation and, in fact, make the 
contrary assertion that there exist certain (unknown) coordinate transformations 
which reduce the exact equations of motion to Newtonian form, corresponding 
to a strictly conservative non-radiating system. Both parts of the latter asser- 
tion, namely the possibility of reducing more general equations of motion to 
Newtonian form and the possibility of reducing a non-conservative (radiating) 
system to a conservative one, seem doubtful. 

The 1927 paper by Einstein and Grommer is also the starting point of a series 
of papers in the other direction. 

In our paper [34] published in 1939 we studied the problem of the motion of 
spherically symmetric non-rotating bodies of finite size and derived equations 
of motion for them, starting from the gravitational equations. (See also [36].) 
The particular characteristics of the method used by us are on the one hand 
the study of the mass tensor, which is determined in parallel with the gravita- 
tional potentials, and on the other hand the use of harmonic coordinates : our 
calculations were so conducted that the equations of motion were obtained 
from the harmonic conditions. In addition we obtained the gravitational 
potentials in explicit form (see Sections 82 and 83 of this book). 

The fact that harmonic coordinates are important in principle was already 
noted in our paper of 1939 and it was discussed more fully in the 1947 paper 
[35], where we pointed out the connection between the question of the unique- 
ness of the coordinate system and that of the preferred status of the Copernican 
heliocentric system. 

In our 1939 paper the equations of motion were obtained only in Newtonian 
approximation, but it was mentioned that the work of Petrova would be 
devoted to the second approximation. That work was indeed completed in 
1940 but published only in 1949 [37]. The equations of motion for a system of 
finite masses in second approximation obtained by Petrova were used by us 
in our 1941 paper [38] on the integrals of motion of the mass centre. 

In 1954 Kashkarov [39] used the same method to derive from the gravita- 
tional equations the equations of motion of rotating masses in Newtonian 
approximation. 

In 1950 Fichtenholz [40] reduced the equations of motion in second 
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approximation to Lagrangian form and hence derived their integrals, 

In his 1951 paper [41] Papapetrou, independently of Petrova, continued 
our 1939 work and derived from it the equations of motion for spheri- 
cally symmetric non-rotating masses in second approximation. However, in 
Papapetrou’s derivation there is an inconsistency of approach in that he 
chooses a priori a weighting function when averaging the equations of motion 
of a continuous medium over the region of each mass. The choice that Papa- 
petrou implicitly makes is justified only at the end of the calculation by a 
direct verification of the harmonic conditions. The same remark applies to a 
more recent paper by Meister [42]. 

In the domain of the theory ot masses in motion a number of new results 
are derived in this book. We shall not speak here of these or of our results 
in other domains of the theory of gravitation, referring the reader to the main 
text. We merely note that for the case of rotating masses of finite size we 
obtain not only the equations of motion themselves but also their integrals. 

In all work on the theory of the motion of masses—both the work of Einstein 
and his school and our own—the case of an “ insular ” distribution of masses is 
discussed, i.e. just the case relevant to our general conclusions on the possibility 
of introducing a harmonic coordinate svstem. Therefore the comparison of the 
two sets of papers gives a vivid illustration of the difference between Einstein’s 
point of view and ours on the question of coordinates and on the fundamental 
principles of the theory. Apart from these general matters the treatinent of 
some concrete questions connected with the statement of the general problem 
is also different; while in Einstein’s work the masses are thought of as points, 
we consider extended masses with a definite internal structure. Einstein’s 
interest in point masses appears to be connected with his aforementioned attempts 
to construct a theory of matter with particles as singular points of the field. 
Einstein’s view of the mass tensor as an “ undesirable element ”’ in the theory 
is connected with these same ideas. In contrast to this view, we consider the 
mass tensor as @ necessary elenicnt of the theory and attach special significance 
to its unique definition and to the examination of the conditions in which such 
a definition is possible. . 

Finally, on the question of the radiation of gravitational waves, we consider 
that although in ordinary astronomical conditions these waves do not play any 
noticeable part, they do nevertheless exist in principle. Therefore in a rigorous 
statement of the problem of the motion of masses we would have to include the 
wave part of the gravitational field; the problem is then no longer a problem 


of mechanics in the proper sense. 


CONCLUSION 


We shall attempt a brief exposition of our point of view concerning the theory 
of space, time and gravitation. 

In most ways our standpoint coincides with Einstein’s as usually expounded, 
but on some essential points it is different. 

Space and time must be considered in conjunction. They combine into a four- 
dimensional manifold which is uniform in the absence of gravitation and then 
constitutes Galilean space. Galilean space possesses a pseudo-Kuclidean 
metric. Its uniformity finds expression in the existence for it of the group of 
Lorentz transformations, which describe the transition from one inertial frame 
of reference to another. The property of uniformity is related to the physical 
principle of relativity, which asserts the existence of corresponding processes in 
any two inertial frames of reference. Theretore the theory of Galilean space 
based on the Lorentz transformations is called the Theory of Relativity. 

Both the property of uniformity and the wholc theory of Galilean space may 
also be formulated in arbitrary coordinates. However, the privileged character 
of Galilean coordinates manifests itself in the particular simplicity (linearity) of 
the transformations that relate inertial coordinate systems, that class of 
systems within which the physical principle of relativity holds. 

The physical basis of the theory of gravitation is the law of the equality of 
inertial and gravitational mass and its mathematical basis is the hypothesis 
that space and time possess a Riemannian metric, which is not given in advance 
and may itself depend on the physical processes occurring in the space (on the 
position and motion of masses). Owing to the fact that the description of the 
gravitational field does not involve the introduction of any quantities other 
than the metric tensor, Einstein’s gravitational theory is a realization of the 
idea of the unity of metric and gravitation. The Riemannian geometry used in 
the theory of gravitation studies the local properties of space and, in the general 
case, tells us nothing of the properties of space as a whole. However, in the 
theory of gravitation local treatment is insufficient, since the field cquations 
are partial differential equations, the solution of which depends decisively on 
the boundary conditions. In gravitational theory, thereforc, one has inevitably 
to introduce some kind of hypothesis concerning the properties of space as a 
whole. The simplest and at the same time most important of these hypotheses 
is the assumption that, at infinity, space is Galilean. Subject to this assumption 
one can impose on the gravitational potentials such supplementary conditions 
that the coordinate system is determined uniquely apart from a Lorentz trans- 
formation. This coordinate system, which we call harmonic, stands in strong 
analogy with the ordinary inertial coordinate system. 


The physical principle of relativity, understood as the statement that there 
exist corresponding processes in frames of reference of a certain class, does not 
hold in the general casc of Einstein space, let alone the “ General Principle of 
Relativity ” within the class of arbitrary coordinate systems. However, within 
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the class of harmonic coordinate systems a principle of relativity is valid which 
is fully analogous to the principle that holds in Galilean space. In this case the 
notion of “ corresponding processes ”’ must include the distribution and motion 
of the masses that determine the metric. 

The physical principle of relativity is connected, directly or indirectly, with 

the uniformity of spacc: either with full uniformity (Galilean space), or with 
uniformity at infinity (space permitting the introduction of harmonic co- 
ordinates), or else fnally with local uniformity; in the last case the principle 
holds only in the infinitely small. One can achieve local uniformity, i.e. uni- 
formity in the neighbourhood of a single point of space, in a geodesic coordinate 
system (the principle of local equivalence between acceleration and gravitation). 
The principle of equivalence thus follows from the Reimannian nature of the 
metric, but it is already contained in Newtonian mechanics and is closcly 
related to the law of equality of inertial and gravitational mass. 
From the point of view of the theory of gravitation the law of the equality of 
inertial and gravitational mass is valid with all the accuracy with which it is 
possible to define independently the inertial and gravitational properties of mass. 
This law is not of local character and is therefore more useful as a basis for 
deriving Einstein’s gravitational equations than the principle of equivalence, 
which requires the use of a frame of reference in accelerated motion—a concept 
for which there is no satisfactory physical definition. 

Owing to the local nature of the principle of equivalence, it is not a sufficient 
basis for the statement that fields of acceleration and gravitation are indistin- 
guishable in finite regions of space, and even less for the statement that there 
exists a “ General Principle of Relativity”. Such a conclusion from the local 
to the general is as inadmissable as would be, for instance, the conclusion that 
all analytic functions are indistinguishable because they all behave as if they 
were linear, in the infinitely small. 

Since the physical principle of relativity is related to uniformity and since in 
the theory of gravitation the uniformity is confined either to the infinitesimal 
or to infinitely remote regions of space, but does not hold in finite regions and 
at finite distances from masses, the theory of gravitation can evidently not be 
looked upon as a generalization of the concept of relativity. On the contrary, 
the theory limits that concept strongly : one can even say that the principle of 
relativity does not play any essential role at all in the theory of gravitation. 
It is therefore incorrect te call Einstein’s theory of gravitation a “ General 
Theory of Relativity ” all the more since “ The General Principle of Relativity ” 
is impossible under any physical conditions. 

The general covariance of equations has quite a different meaning from: the 
physical principle of relativity; it is merely a formal property of the equations 
which allows one to write them down without prejudging the question of what 
coordinate system to use. The solution of equations written in generally Co- 
variant form involves four arbitrary functions; but the indeterminacy arising 
from this has no fundamental importance and does not express any kind of 
“ general relativity » From a practical point of view such an indeterminacy 
even represents something of a disadvantage. It seems to us that it is preferable 
to have a mathematical statement of the problem in which uniqueness of the 
solution may be achieved (naturally to the same extent as the essence of the 
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problem permits). 

In our statement of the problem of an isolated system of masses the unique- 
ness of the solution is achieved by introducing four supplementary equations, 
the harmonic conditions, and also of boundary conditions, and by passing 
from a local description to a description of space as a whole. In this way the 
uniqueness of the solution is achieved in all the problems considered : in the 
problems of the equations of motion, of radiation, of the form of the rigorous, 
the approximate and the asymptotic solution of the gravitational equations 
and so forth. 

In the case of an isolated system of bodies the question of the coordinate 
system is answered in the same way as in the absence of gravitation : there 
exists a preferred system of coordinates (Galilean or harmonic) but it is also 
possible to use any other coordinate system. As in the Galilean case, the privi- 
leged character of harmonic coordinates shows itself in the linearity of the trans- 
formations which connect the coordinate systems within the class in which the 
physical principle of relativity holds. This property of harmonic coordinates 
distinguishes them physically from other coordinate sytems; in particular it 
is this property which allows one, as in the Galilean case, to give preference to the 
heliocentric theory of the Solar System over the geocentric theory. 

The existence of a privileged coordinate system is in the first place important 
as a matter of principle, in that it reflects objective properties of space. In 
addition, since this coordinate system: is of a standard character, it allows one 
to establish the geometrical meaning of other coordinates and to make com- 
parisons between solutions found in other coordinates. Finally, harmonic 
coordinates have great practical significance, because calculations can be 
simplified enormously by their use. 

When discussing regions of space that include many galaxies the notion of 
an isolated system of masses becomes inapplicable and the properties of space 
as a whole must be formulated differently (Friedmann-Lobachevsky space). 


APPENDIX A 


ON THE DERIVATION OF THE 
LORENTZ TRANSFORMATIONS 


In Section 8 the equations were derived which must be satisfied by the 
transformation function 


My =f A255 Lys a) (¢ = 0, 1, 2, 3) (A.01) 


in order to ensure that the following two conditions are fulfilled :— 

(a) To a uniform rectilinear motion in the coordinates (x;) there must 
correspond a motion of the same nature in the coordinates (2;) ; 

(b) To a uniform rectilinear motion with light-velocity in the coordi- 
nates (2;) there must correspond a motion of the same nature in the 
coordinates (z;). 

Condition (b) is equivalent to the condition : 

(b’) To the wave front equation 
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in the coordinates (x;), there must correspond just such an equation in 


the coordinates (z;). 
It was shown in Section 8 that functions satisfying condition (a) must be 


solutions of the system of equations 
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whereas functions satisfying (b) or (b’) must be solutions of the system 
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A.04) we shall derive another equation similar in form to (A.03). 
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We now introduce the quantities I's, defined by the relation 


3 
af, Of 
—_—__—— ——_- = xe T3 (A.07) 
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Equation (A.06) may then be written as 
eTh, belt, = 2omexdxi (A.08) 


To find the quantities [%,, in terms of the 9m, we write down two equations 
obtained from (A.08) by cyclic permutation of the indices k, 1, m: 


emi, + ely = 2pxe131m (A.09) 
xl, + emi = 2p1emSmk (A.10) 


Taking their sum, subtracting the original equation (A.08) and using the 
symmetry of I'L, with respect to the lower indices we obtain, after multiplica- 
tion with en, 

Pe a «01m =f poem = CmPmeKdk! (A.11) 


Now, equations (A.07) may be solved with respect to the second derivatives 
of fy. Using (A.04) we get 


ey tg (A.12) 
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Inserting the value of ['% from (A.11) we obtain finally 
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Such equations for the transformation functions (A.01) follow from condition 
(b) or (b’) that is concerned with motion with the speed of light. On the other 
hand, condition (a) concerning the uniformity of motion leads to equations 
(A.03), and these can also be written in the form (A.12) if we put 


Vt, = bedim + Videm (A.14) 


We have seen in Section 8 that if both conditions (a) and (b) (or both (a) 
and (b’)) are imposed simultaneously, all second derivatives of the functions 
fi are zero, the quantity > is constant (and can be put equal to unity), and the 
transformation in question reduces to a Lorentz transformation. 

We must now investigate the consequences of condition (a) taken separately 
and of condition (b) or (b’) taken separately. 

Equations of the form of (A.12) have been investigated in detail in Section 42, 
where a necessary and sufficient condition for their complete integrability was 
given. This condition has the form 

az am 
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It is deducible from the requirement that the various expressions for third 
derivatives of the f; obtainable by differentiation of (A.12) should all agree. 
Inserting the expression (A.14) into (A.15) we get 


n 
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Here we put k 4m;1 4m; but n = mand get 
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Since the value of the index m in (A.16) is arbitrary the restriction to k 4m 
and I + m is inessential and equation (A.17) must hold for all values of k and lL. 
Evidently this equation is also sufficient for (A.16) to be satisfied, for it follows 
from it that 


Me _ ‘(A.18) 


and the first term in (A.16) also becomes zero. 
We now insert the expression (A.11) into (A.15). In consequence of (A.05) 
we may put 


Op, as a9, = (A.19) 


Evaluating the left-hand side of (A.15) we obtain after multiplication by em 
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Hence it is easy to deduce that necessarily 

®,, = 0 (A.23) 


For, if we multiply the left-hand side of (A.22) by —ém, put m =n and sum 
over m we get 
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Multiplying the left-hand side of (A.24) by ex, putting k =1 and summing over 
k we get, after division by 6: 
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Thus the condition for the integrability of (A.03) is (A.17) and the condition of 
integrability of (A.18) is (A.23). As a result of (A.18) and (A.19) we may put 


wos, ong (A.26) 
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where } and @ are unknown functions (according to (A.05) we can take 
¢ = log \/a). Then the conditions of integrability become 


OE nck ONY, (A.27) 
Ox, Ox, 8X, On, 
ao ap dp 1 2 ( ad 2 
and —+— ——~— + -6,8 e.{—} =9 (A.28) 
x, 02, O82, 0%, 2 ae 2 "\ an, 
If here we put ev = Ww; = —logw (A.29) 
and also e? =U; p= —logu (A.30) 


the new unknown functions are seen to satisfy the equations 
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Equation (A.31) shows that w is a linear function of the coordinates 


w = WO) + Wey + WL, + Wel, + Wy (A.33) 
and that therefore 
w 
es (A.34) 
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where the wz are constants. Inserting these values of the | into the equation 
(A.03) for f; we can write the latter as 


Oa, Ox, 


(A.35) 


whence it follows that the quantities wf, are linear functions. This means 
that the four functions fy, f;, f, and f, are ratios of linear functions, all with the 
same denominator w. 

We thus arrive at the following conclusion. The most general form of the 
transformation (A.01) which satisfies condition (a) is a transformation involving 
linear fractions, all with the same denominator. In the particular case when the 
denominator reduces to a constant the linear fractions become linear functions. 
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We now turn to the discussion of equation (A.32) which follows from condition 
(b) or (b’). In the first place.we note that for & + 1 its right-hand side is zero. 
Therefore the second derivatives of wu with respect to two different variables 
vanish and the function u is additively composed of single variable functions. 
On the other hand, it follows from (A.32) that the double derivatives of u 
with respect to a single variable are all equal, apart from a sign 


074 O7u O2u O7u 
— oe — 


eo ass = €3 = (A.36) 
° aa? da? aa? aa? 

But each of the quantities of (A.36) can either be a constant or depend on 
one variable only (that with respect to which the differentiation is performed), 
they are therefore all equal to the same constant. We call this constant 2C 
and then have 
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It follows from (A.37) that u is of the form 
3 3 
u=C dem —2 > e025, + B (A.39) 
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where the a, and Bare constants. Equation (A.38) only gives a relation between 
the constants namely 


BC = Se,02 (A.40) 
r=0 


According to (A.05) and (A.30) the factor in (A.04) is inversely proportional 
to u®, We can put 


StH = os (A.41) 
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Without restriction of generality we can assume that at the origin, i.e. for 
Dy = Ty = Tg = 23 = 0, We have:u =1. This can always be achieved by 
changing the scale of the f; or the xx. With this condition we get 
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We replace the quantities 9, in (A.13) by their values 
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If u is of the form (A.43) the conditions of integrability of the system (A.45) 
are satisfied, and the functions are completely determined by the values at the 
origin of the functions themselves and their first derivatives. Let these values be 


(filo = 5 (#) — CKOLK (A.46) 
Since at the origin u = 1, the equation (A.41) gives 
2 Ce = Ser (A.47) 
We now put rs 
f= 0 + 3 ertgd? (A.48) 


By virtue of (A.47) these equations can be solved for the f* which are then 
expressed as linear functions of the f;. Evidently the {? satisfy the same system 
of differential equations as the /; ; for the system (A.45) it follows from the linear- 
ity of the equations, for the system (A.41) it follows from (A.47). But the 
initial conditions for the f* are of the form 


(f*)o = 0; (2 ‘) = = Srx (A.49) 


OLk 


Thus if we find a solution of the system (A.43) with initial conditions of the 
particular form (A.49) equation (A.48) gives us the most general solution of the 
system. 

The system (A.45) may be written in the form 


O(uf*) Ou of* 
Ox, Ox, = SOT ae 2a, 2, ax, + 8 > mim az, em 
and, using (A.37) we get 
e?(uf*) of 
Ox, an, by, ~ Bu (20% % > emmy a) on) 
m=0 
Here we put 
F; 
ft = = (A.52) 
By use of (A.38) we obtain 
O°F; 1 oF; 
2, a, = CH :(- 2CF; + Se nin 5 =) (A.53) 


m=0 
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Arguing in the same way as in the discussion of (A.32) we get 


O2F, 
| an, On, => 2C 6,841 (A.54) 
with | 
3 
= OF, 


It is easy to see from (A.52) that the initial conditions (A.49) for the Ji lead to 
conditions of the same form for the F;, Equation (A.54) together with the 
initial conditions gives 
3 
| C2 eyre + a, (A.56) 
=0 


Insertion of (A.56) into (A.55) allows us to determine the values of the constants 
Cy, namely 


Ci (A.57) 


With these values for the constants equation (A.55) becomes an identity. 
Therefore we have 
3 
F, = 2,—a,> eae (A.58) 
k=0 
and 


3 
Ty — Oy 2 Cap 
- k=0 
ft= : : : (A.59) 
1—2 deo, + Dea? > eu? 
k=0 k=0 1=0 


We arrive at the following final conclusion: 
The form of the most general transformation (A.01) satisfying condition 
(b) or (b’) is obtained from the transformation 
3 
Uy — Oy D ey 
Te a a a NO (A.60) 


2 2 
1] —2 dea,a, + > eof d. ea 
k=0 k=0 l=0 


together with the transformation 
(A.61) 


3 
ble * 
= 4, +2 Cy Ny 
=0 


where the coefficients ay, satisfy condition (A.47). In addition, a change of 
scale is possible. 

Transformation (A.61) is the usual Lorentz transformation, and (A.60) 
bears the name of Mébius transformation. The latter has many remarkable 
properties which, however, we shall not dwell on. 

For us, it is important to note that the requirement that the form of the 
wave front equations be conserved does not by itself lead to the Lorentz trans- 
formations since it also allows Mobius transformations. To remove the latter 


A3 
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one can make the additional demand that finite values of the initial coordinates 
should lead to finite values of the transformed ones. This additional require- 
ment is satisfied only if all the constants a, in the Mobius transformation are 
zero, making the transformation the identity. Instead of this additional require- 
ment one can make another, namely, the condition that uniform rectilinear 
motion be conserved (condition (a)) ; this is how we proceeded in the text. 
Hither of these additional requirements leads uniquely to the Lorentz trans- 
formation, apart from a possible change of scale. 

It is also of importance that the requirement that the coordinates remain 
finite refers to space-time as a whole, whereas the conservation condition for 
uniform straight line motion is strictly local. 


tT The discussion and conclusions of this appendix follow very closely those of Wey] [7]. 


APPENDIX B 


PROOF OF THE UNIQUENESS OF THE ENERGY 
MOMENTUM TENSOR OF THE 
ELECTROMAGNETIC FIELD 


For the electromagnetic field in vacuum the state functions are the compo- 
nents of the field 


Ey, E,,E;; Hy, HH, (B.01) 
and the first order equations of motion satisfied by them (field equations) are 


Maxwell’s equations 


1 dE 1 dH 
curl H —-— = 0, cur] E + ~ — =- 0 (B.02) 
c a c at 


together with the supplementary conditions 
divE=0, divH=0 (B.03) 


As in Section 23 we shall characterize an infinitesimal Lorentz transformation 
by the quantities 


V; Vy Ve 
O19 = —; Os9 = —; O39 = _; 

c Cc Cc (B.04) 
Oe3 = Oz; O31 = Oy; Oy, = Wz 


Using three-dimensional notation we can write down the change in a field 
component in an infinitesimal Lorentz transformation as 


3E =~ [V x H] + [w x E] 
(B.05) 
SH = — L(V x E] +o x A 


Coniparison of these equations with the general equation (31*.04) gives us 
the coefficients (31*.09) and at the same time the operators X!™. We get 
7] 0 7] 7] 
gp, — Hogg, -B +E,;-—  (B.06) 


OE, °2H, oH, 
and two similar equations obtained from (B.06) by cyclic interchange of the 
indices 1, 2 and 3. Introducing the antisymmetric set of quantities 


X00 xo _y 


€ix, = 4-1, if (2, &, 1) are an even permutation of (1, 2, 3), 
eiet = —1, if (2, k, 1) are an odd permutation of (1, 2, 3), (B.07) 
eixy = 0, if there are equal suffixes among the (2, f, 1) 
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Comparing (B.14) with (B.15) we are led to the followmg relations. In the 
first place the coefficients of 0F;/0a for i = 1, 2, 3 must be equal. This gives 


@E, OE, @&, re) 
Similarly the coefficients of 0H;/éx; must be equal for i = 1, 2, 3. Hence 
OT™ i oT? _ oT3 


0H, 0H, 0H, ey 


Secondly, for « 4 k the coefficients of dE ,/éa; and 0H;,/éa; in (B.14) niust be- 
come zero, since such terms do not appear in (B.15). This gives us the relations 


oT eT~“® 
a ke — =0 B.18 
OB,  * oh 18) 
eT eT” 
aap = B.19 
OH + Ce1tk aE, ( ) 
Hence ifi 4k 
er = aT** 
ene Caer ees | B.20 
aE, | OE, a 
eT oTok 
—— += 0 B.21 
OH, * Oly re 
If (¢, k, 1) is some permutation of (1, 2, 3) we easily see from (B.20) that 
4 
Ri (B.22) 
OEE, 
and from (B.21) that 
e277 
eee (B.23) 
0H ,0H; 
Differentiating (B.18) with respect to H, and using (B.22) we get 
27120 
Se =0 (2 =: 1, 2, 3) (B.24) 
obo, 


The same equation can be obtained by differentiating (B.19) with respect to 


H, and using (B.23). 
Let us now form a system of equations for T°. According to (B.24) we have 
or 0) (k= 1,2,3) (B.25) 
oF 0H 


Putting « ='0 in (B.18) we get 
eT® oT”? 


a. Ce —_ (B.26) 
OE; re oH, 
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On the other hand, equation (B.24) for a = ¢ gives 


2708 
cae (B.27) 
GH ,0Hy 
Hence 
02T 00 
———— = 0 kl B.28 
5H, oH, (k # l) (B.28) 
Similarly we obtain 
Eel 0 kl (B.29) 
0E,0E, vee 
Putting « = 0 in (B.19) and replacing i by | we get 
oT” éT 
oT = tt ae (B.30) 


Differentiating (B.26) with respect to H), (B.30) with respect to £, and equating 
the results we obtain 
O27 00 O27 00 


GEE = HE (k # 1) (B.31) 


The system of equations we have obtained for T° is easy to solve. Since T° 
is a three-dimensional scalar, it can depend on the field components only through 
the quantities 

E=3}E, 4=(E-H), ¢=4H? (B.32) 
so that 
To = f (6, n, 9) (B.33) 


Indeed, this expression is a solution of the equation X* (74,) = 0, where X* is 
given by (B.10). Let us now use equation (B.25). 


We have 
QT af af 
eee 1 ae ih oes B.34 
OE x a a Hi ay, eo) 
and further 
e277 af ar ae arf of 
eee ee og 1 2 a 8) dh pees H? B.3 
oH,oH, ant aes 3 = & 9G, 1 te Gna (PP) 


As a consequence of (B.25) this expression must vanish for k = 1, 2, 3. 
Evidently one can always make a rotation of axes in such a way that, say 


for k = 1 the quantities H, and H, vanish simultaneously. Then equations 
(B.25) and (B.35) with & = 1 give 


of 

a 
and all second derivatives involving differentiation with respect to y also vanish. 
If one takes this nto account, the same equations for k # 1 give 


of 
a€ at 


=0 (B.36) 


=0 (B.37) 
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By virtue of (B.36) the equations (B.28) and (B.29) lead to 
o"f o"f 


Thus the quantity fin (B.33) is a linear function of E and C. Taking this into 
account, we get 


O27 00 rs) o2T 00 rs} 
——— x — x (B.39) 
OE: oF oH; ae 

and then equation (B.31) gives 


f of 
ER (B.40) 
Since f is a linear function, we get 
f=of&6 +0) +a (B.41) 
where « and d are constants and therefore 
To — 5 (E2 + H2) +> (B.42) 


Knowing T° we can find the remaining components of T#” from equations 
previously derived. Thus, insertion of the values 


oT °° eT 


= «E —- = oH B.43 
7) nr: Soa 
into the formula 
P= EXT) (B.44) 
(see (31*.18)) gives the energy flux 
T” = ofE x Hi: (B.45) 
(the Umov—Poynting vector). This quantity may be written in the form 
3 
T? = a > eipqlpHg (B.46) 
pq=l 


Inserting this expression into the formula 
Tie — (AX%X™ — $y)T (a, k = 1, 2, 3) (B.47) 
we obtain, after some manipulation 
Tk — a[45.4(E? + H?) —_ RiEx — A Hy) — Aix (2, k= 1, 2, 3) (B.48) 


Here we have used the relation 


3 
> eipsthgs = Stdpq — Siqdke0 (B.49) 


s=1 


It is not difficult to verify that the expressions found for the energy tensor 
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also satisfy all the other equations (B.16)-(B.19) and not only those which were 
used in their derivation. 

If one assumes that for E = 0 and H = 0 one should have 7" = 0 one gets 
2 = 0. If one interprets 7+’ as the energy-momentum tensor (and not the mass 
tensor), then, with the customary choice of units, one has « = 1/4. Thus the 
tensor J** is determined uniquely and the result agrees with the generally 
accepted formulae (33.23)—(33.25). 


APPENDIX C 


PROOF OF THE UNIQUENESS OF THE HYDRODYNAMIC 
MASS TENSOR 


In Section 32, when we were deriving the relativistic generalization of the 
hydrodynamic equations of an ideal fluid we started from a definite form of the 
mass tensor. We shall show here that the argument can be conducted in 
the reverse order: taking the equations of motion as one’s starting point one 
can derive the mass tensor from them. 

As functions of state one can take the pressure p and the invariant density 
o* (two scalars) and the four components vu‘ of the four-dimensional velocity 
vector. 


The quantities p and o* will be related by the equation of state and the u! 
will be connected by the identity 


3 
> ex(us)? = c? (C.01) 


In place of the pressure p it is convenient to take the dynamic pressure P, 
which is related to p and o* by the equation 


Pd 
p= | oP (C.02) 
* 
Fs p 
As a consequence of (32.27) we have 


pP="+ = (C.03) 


where IJ is the potential energy of unit mass. If f is any function of P, p* and 
the u? its variation in an infinitesimal Lorentz transformation will be 


sf WiKequ out C. 
Equating (C.04) to the general expression 


3 
Sf = ; > coaeX*#(f) (C.05) 
i, k=0 


a 


we get the following expression for the operators Atk: 
a 


a 
Xtk 2= euk or tae exut Duk (C.06) 
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Evidently the scalars are not involved in the formula for Sf and therefore also 
not in the expression for the operators X**. As for the partial derivatives with 
respect to the uf, it is immaterial in forming the operators X** whether one 
assumes all four quantities ué to be independent or whether one takes into 
account their relationship (C.01). 

We introduce the symmetrical system of quantities 


1 
atk — C1045 ne utuk (C.07) 
c 
. which possess the properties 
3 
> esasus = 0 (C.08) 
; s=0 
and we denote by w‘ the four-dimensional acceleration vector 
3 
dut 
i= piers 
w > ut (C.09) 


0 
Then the hydrodynamic equations may be written in the form 


P s|  aP 
( of 4 wt — » lesa = (C.10) 
k=0 . 
8 
é *,,K 
> a (Pre) = 0 (C.11) 
k=0 


In addition, it is necessary to take into account the relation 


Gus 
s_eses=_ 
Z Cg : 0 (C.12) 
which follows from (C.01). 


As a consequence of (C.12) we have the equation 


3 
> egursws — 0 (C.13) 
s=0 


and the relations (C.08) show that this equation is consistent with the equations 
of motion (C.10). 


We now write down the equations that express the fact that the divergence 
of the mass tensor is zero by virtue of the field equations. We have 


3 3 
aTtk dus aTtk ap = P\ aw aP 
—~=+)> = 4 —]} uk — — ask __ 
2 Out One > OP Oa 2 at (1 a Ory a oa 


8 0 b= 8, k=0 C 
3 
dp* oP 2 Cut 2 Cut 
+ xt > uk > *5 > tk 
> dP oxy : 8=0 ais 4 . a - OLE con 


Here the right-hand side represents the sum of the expressions (C.10), (C.11) 
and (0.12) multiplied by the Lagrangian multipliers dj, Af and wu! respectively. 
After introduction of the Lagrangian multipliers equation (C.14) can be looked 
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upon as an identity in tne derivatives with Tespect to the coordinates. Equating 
first the coefficients of us/@x, and then those of OP/0cy we obtain 


are fp 
Bue = 4S ( + 3) uk + rN o*S54 + uegus (C.15) 
OT tk 3 do* 
=p > are 4 nt a uk (C.16) 
s=0 


The Lagrange multipliers must be eliminated from these equations. Applying 
the relation (C.08) to the second equation, we get 


1 oT ts 
=0 
Inserting. this value into (C.16) we can write that equation (replacing & by j) in 
the form 
3 3 arts 
rig = aj 
Z ‘a >, °# aP (C.18) 


We now multiply both sides of (0.15) by a‘ and sum over s. On the right- 
hand side the factor multiplying y* becomes zero as a result of (C.08), so that 
the quantities u‘* are eliminated. The ¢ on the other hand enter only through 
the sum (C.18), which is known. Finally, for the d¢ one can insert their values 
from (C.17). As a result we obtain a relation which may be written as 


3 3 
OoTtk P oTis a oT 
_ | ak = i = 19 
>" ae + (1+) - od =| 0 (C19) 


This is the required system of equations for the tensor T?*, which follows from 
the condition that its divergence vanishes by virtue of the field equations. 

Now we must take into account that the set of quantities T?* form a tensor. 
This we can do by one of two methods. We can adjoin to (C.19) another system 
of differential equations, namely 


X15(Ttk) = Sir T8* — 404g 1 T* + exer 1 8 — CrOng l (C.20) 


where the operator X’* has the value given by (C.06). Alternatively, we can 
write down directly the most general expression for a tensor which is a function 
of the vector w* and of certain scalars. This expression has the form 


TH = Autuk + Beds, (C.21) 


where the quantities A and B depend only on the scalar o* (or equivalently 
only on P.) A dependence on the invariant of the vector w* need not be taken 
into account, because according to (C.01) that invariant is a constant. 

It is easy to verify that the expression (C.21) satisfies the system of equations 
C.20). It is a general solution of that system. 
7 can now insert the expressions (C.21) for 7* into (C.19). We first cal- 
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culate the internal sum in (C.19), using the relation (C.01). We get 


le ore (dd , 1 4B 
ee 
r=0 
Inserting (C.21) and (C.22) into (C.19) and using (C.08) we obtain an equation 
of the form 
3 
> ais(Mut8zx + Nuk8si) = 0 (C.23) 
3=0 
or 
Maskut + Nastuk = 0 (C.24) 
where we have put for brevity 
dA o* dB 
= A — p*¥— —-— .25 
te dp*  c® do* C2) 
P\ dB 
= 1+-—)— .26 
N 4 +( +a)5 (C.26) 


It is evident that the equation (C.24) can hold for all values ot 2, 7 and k only 
if M = 0 and N = 0. Thus the scalars A and B in the expression for the tensor 
T** must satisfy the equations 


Apt ee Sef (C.27) 


dB 
At (145) B=9 (C.28) 


Differentiating the second equation with respect to p* we obtain 


dA i: 1 dB 1 P\ d (dB 0 C29 
i sae t (1 +5) as (Gp) = ai 
and then the first equation gives 
fa d (dB 
A 1+-—) et — [(—] = 
+(\+a) ale) -° “ 
Eliminating A from (C.28) and (C.30) we get 
d dB 
, eS, ieee 
(c ie* 7p 0 (C.31) 
Hence 
dB 2 
qP => aoe (C.32) 


where a is a-constant. But owing to the rélation (C.02) between the dynamical 
pressure P and the ordinary pressure p we have 


e*dP = dp (C.33) 
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Therefore equation (C.32) can be written as 


dB +adp=0 (C.34) 
and its solution gives us 


where A is a new constant. The quantity 4 may be determined from (C.28) 
and (0.32); we get 


P 
A =ao* (2 a 3) (0.36) 
and the expression for the mass tensor takes on the form 
P 
Ttk —@ [> (3 + 4 ulyk — pedi + redex (C.37) 
If we assume that at infinity, where p* = 0 and p = 0, we have 7#* = 0, then 
> = 0. With the usual choice of units one has to put a = 1 in the energy tensor 
and a = l/c? in the mass tensor. This determines both constants and the 
formula for the ass tensor assumes the following final form: 


c®?Ttk — o* (2 + 5) utuk — pedix (C.38) 
oe 


As a consequence of (C.03) this formula agrees with (32.28). 


APPENDIX D 


THE TRANSFORMATION OF THE 
EINSTEIN TENSOR 


(a) The Transformation of the Second Rank Curvature Tensort 

We begin with the transformation of the covariant second rank curvature 
tensor #,, and show that in, the expression for this tensor, it is possible to 
separate off terms containing the d’Alembert operator applied to that com- 
ponent g,,, of the metric tensor which has the same suffixes. 


By definition we have 
Ry = 9 Ras av (D.01) 


where, by (44.08), the fourth rank curvature tensor is 


1/ 0%9, 0°9,5 8g 8°9.,5 ) 
pas Yt Re ee SE PE ee Pe 4g Pes 
Ha By le Or, 0,00, On, 0%, Ox, OL, Foor ua va T Too iw a 

(D.02) 


(this equation differs from (44.08) only by the naming of the indices), Therefore 
1 0°9,, 1 09,8 1 0°69. 


R= qs < gah B 
ee go On, OLy +3 dx, Oa, go Ox, OL, 
1 279.15 . 
= g* a, oe, = 7 "gol Pol on + |e (D.03) 
For brevity we put 

Tr, =g,,I” (D.04) 

where in agreement with the definition (41.15) 
Te = "Pe, (D.05) 


In (D.03) the first term is already in the form of the d’Alembert operator 
applied to g,,. The remaining terms may be so transformed that the second 
derivatives of components of the metric tensor only appear through the first 
derivatives of the I',. To perform this transformation we need the relations 


Q 
ia, 8 (-9) = 9 3 (D.06) 
ary age* 
iC) =_l 
oa, 9 Be, (D.07) 
and 
1 2 
[t= — —g)-gt8 D. 
Toa VO (D.08) 


t See also the papers by de Donder [16] and by Lanczos [17]. 
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which were derived in Section 41. From these telations it follows that 


Gay 1 Alog(—g) 


I, = g6 
v= GF on, 2 da, (D.09) 
Differentiating (D.06) with respect to x,, we get 
a? 8? log (— ag? a 
gt —Ja8_ log (=9) _ 29° aug (D.10) 
dz, On, ax, Ox, an, OG, 
We note that owing to the symmetry of (D.10) in p and v we have 
OG 90g, _ 29° ag 
On, Oz, Ox, On, ay 
Differentiating (D.09) with respect to z, we get 
re Pe ee a 
02, OL, 2 dn, du, On, 0, Og Del) 
In the last term on the right we may replace a,,/22, by 
1 (a9 09. ) 1 2,5 
= Ne ee a Oe D.13 
(ate Mas +5 ez, ee 


where I’, ,g, is the usual Christoffel symbol of the first kind (38.30). Equation 
(D.12) then appears as 


seit, SO Dirt gs 6 ON Oe gs ii 
Ou, Ot, 2 dn, aa, Oz, 2 ax, Ox, —  ** an, 
(D.14) 


We interchange the indices » and v and also the summation indices « and 6 
on the left. Then we can write 
2 = B B 
dn, Oz, 2 dn, ox, dz, 2 On, Ou, ax, 
(D.15) 
Thus the sum of the expressions (D.10), (D.14) and (D.15), half of which enters 
(D.03), is equal to 


( 29,0 8°94 8"9ua 


ox, OL, Oz, Ox 
c} ic} 


ae ot ae 
dz, On, v a6 az, BoB ez, 


(as a result of (D.11) the remaining terms cancel). 

We introduce the notation 
rr, @& 
—i (Su, Sr) _ pp (D.17) 

(5 + Ox, Ke 

The quantities.I’,, are formed in the same way as the symmetrized covariant 
derivatives of a vector, although I’, is not a vector. As a result of (D.16) the 
expression (D.03) for A, then becomes 


ag? 

1 9 99 a Op get ar 
=: ead = = a 0, uB> va 

Ruy — 5 Sf on, : tae 5 rl. af on, + 9 p ap an, uB 


(D.18) 
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To simplify the calculations to be performed on the terms involving first deriva- 
tives we shall use not only the notation I',,, and Ty, for the usual Christoffel 
symbols but also the notation 


Teh = gregh] (D.19) 
and 
Pina = gegheTs, = gieghegT, (D.20) 
for the corresponding quantities with raised indices. We note that [4° is 
given by ; ; ee 
be 
Tv 8 = alo oy — gee eg" — g? <) (D.21) 
2 Ox, ox, ox, 
We consider the expression 
Gua You ; 
A, = 9°"? Big ae mee ja beer bie (D.22) 


whose last term coincides with the last term in (D.18). We write this expression 
in the form 


of Guo ov 
A, = 99° (es pet Tal | (D.23) 
We insert 
iro 
Big el WER en oo) De (D.24) 
and then use the equation 
29. 
a, => Pigs — oe (D.25) 
Then we get 
a og Vv 
A, = 99° (eee op Lee Aa (D.26) 


Using the notation of (D.19) and relabelling, we may put 


29 5, 
Ay = TST, ug +18 (D.27) 
Since the coefficient [2° is symmetric in « and B we can replace the factor 
multiplying it by its symmetric part, which is equal to }29,,/0”,,. For the same 
reason we can replace @9,,/0z, by (D.13). Doing this we get 


rr ee 
A,,, = 5 Tee 228 4 — pop a8 4 pas 
wi 2” Sn, tot ax, au beng Wore (D.28) 
But, as is easy to verify, 
a i) 
ret 8 op a (D.29) 
ox, On, 
Therefore 
1 ag ag® 
w= ~ 9h v08 Be 5 Paws Go tT an (D.30) 
ub Vv 


Equating the expressions (D.22) and (D.30) for the A zy We obtain the relation 
a ea uo & 
= — = ey eB os —. Ta aB~pa a Yoy 
aaa ar, 2 was ox, a rs ner, Egat oe a By oe 


(D.31) 
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which enables us to write the expression for R,,, in the following final form 
0*g 0g, 2 
R. =4q8 —™. — _ nxheo Gus Gov | ne 
Tr) ag , ry gg? pads 2c, + re l’, 46 (D.32) 


From this it is easy to obtain an equation for the contravariant components 
of the second order curvature tensor. 


By differentiating relations of the form (D.07) it is simple to derive the 
equation 


07g 0°g 29,5 G45 2 
SI S95 uy 4 geo Guo Ive 4 uo Fre 
IuoIve Ox, OL, ox, ar, +e (se OX, ah: Oy *) eee 
Using this we can write R,,, as 
07g"? 
Ruy = — F909" Sa Oi at is Bea (D.34) 


ox, OL, 


If we then raise the indices p and v we get the following simple expression for 
RY: 


d%gitv 
Ry = — 19% aE — Te 4 Pea8py, (D.35) 
The quantity I’ is obtained from I’, by raising the indices according +o 
Tey = gteg Tl (1.36) 
and may he expressed directly in terms of the I* defined by (D.08) as follows: 
TY = : (oe ~ + 9% 7 _ a r*) (D.37) 


(b) Transformation of the Invariant 
We now form the invariant of the curvature tensor : 


R = g,, he (D.38) 
We have 
agit¥ 
= —}g% me ee Bend ba D.39 
$9 Tuy an, ax, F vo af ( ) 
where I’ denotes the quantity 
rz ud” (D.40) 
Using the relation 
0108 (29) ny, (D.41) 
Ox, OL, 
we obtain from (D.37) 
az =: 1 @log (—9) 9 
a tee See (D.42) 
is ox, z 3 ot, 
and by virtue of (41.16) 
Ps 1 a%{4/(—9)-97"} (D.43) 


- V/(—9) on, 02g 
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Differentiating equation (D.41) with respect to z, we obtain, in analogy with 
(D.10) 


; dig _ BP log (—9) _ 29" Buy (D.44) 
WY 0X, OL, 02, OL, az, Oz 
Inserting (D.44) into (D.39) we get 
pa hgs Z8(—9 _p prem, +g Be (D4) 
2 02, OLg On, IW, 


We see that the second derivatives of the metric tensor enter the expression 
for R only through the second derivatives of log (—g) and through I. The 
terms involving first derivatives can be transformed with the aid of the relation 


ag’ ag ag7* 
Tespy p 9” uy _ 1p (D.46 
vag T a On, Ot, 2 28 On 


v 


which is easily deducible from the wie 
8 
pes 1% I 59 (9 a9 4 gon (D.47) 
We now get the result 


1 4g 0? log (—9) ag*® 
R= _ ql es _T Ii (D.48) 
ao a, ax, +35 os, 
This expression can be written in the form 
2 —_ 
OL, IL, Cz, 
B ~~ 
where yA pe aA (D.50) 
on, on, 
Remembering the equation 
oy ey 
O 8 —Tts 
4 9 a oa, ax, OL, aL, On, ee 
which gives the d’Alembertian of any function y, we may write 
R=OlogV(—g) -T—L (D.52) 


Of course the quantity y =: log 1/(—g) is not a scalar, but formally the operator 
of (D.51) may be applied to it. We note that both the first and the second 
term in (D.52) represent a sum of derivatives of some quantities with respect 
to coordinates, divided by »/(—g). This fact is of importance in the formulation 
of a variational principle for Einstein’s equations, when the quantity D defined 
by (D.50) plays the part of the Lagrangian (Section 60). 

Apart from (D.50) there are several other ways of writing the Lagrangian L. 
We quote the following 


a = 
L= 41%, 28 Gao _ pe 9 log v( 9) (D.53) 
on, Ox, : 
and also 
Ms Goal oa. Fe ca) (D.54) 


the last form is most commonly encountered in the literature. 
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(c) Transformation of the Einstein Tensor 
The foregoing equations ‘enable us to write down an expression for the 
divergence-free Einstein tensor 
Guy = Rev — 1gR (D.55) 


We shall see that the second derivatives of the metric tensor enter G#’ only 
through second derivatives of the quantity ./(— g)-g"’ and through first deriva- 
tives of I’. It is, therefore, convenient to introduce a special symbol for +/(— g) 
times the contravariant components of the metric tensor. 
We put 

gt’ = /(— g) “gh (D.56) 
Then equation (41.16) may be written as 
_ 1 age 
For the following it is convenient to transform all equations in such a way 


that they involve only derivatives of the g*’. In performing the transformation 
we shall encounter derivatives of the quantity 


v 


(D.57) 


y = log /(—9) (D.58) 
which we shall write as follows : 
_ dlog+/(—9) ) D.59 
Yau aa, (D.59) 
According to (41.07) we have 
%=TD (D.60) 


For the quantities obtained from the y, by raising indices we introduce the 
notation 


y* = g*°y, (D.61) 
just as for a tensor (although of course the #* are not a vector). We have also 
yt = Te (D.62) 


where T'@* has the meaning defined by (D.19). We shall denote the second 
derivatives of y by y ag: 
7 lo — : 
ja Bev (D.63) 
OX, OX, 
According to (D.2]) the r4.28 are bilinear functions of the components 9!” 
and of their first derivatives. Inserting the expression for g*” in terms of g*” 


into that relation, we obtain 


Te. 28 = [4,98 + Au, 28 (D.64) 
1 ag"* age? “6") 
[Iu 28 = —_{ g% 4 gee — que 2 (D.65) 
where 5 -(3 Oa, 8 az, 8 ax, 
and AM 28 = 2 (yagh® + yPgne — yeg*®) (D.66) 
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We shall denote the corresponding quantities with lower indices by II, and 


A4,. 
We evaluate the determinant of the g”’ : 


Det g* = {/(— g)}* Det g” = gr ; =9 (D.67) 
Thus the determinant of the g”” is equal to the determinant of the g,, : 


Det g’” = Det gy, =9 (D.68) 


From equation (D.66) we obtain 


Jag” «8 — — yt (D.69) 
and since Jag * = Te (D.70) 
it follows that Jogi» #8 = Te +. yt (D.71) 


The last expression is also equal to 


hoe a (D.72) 


We go on to the transformation of the Einstein tensor 
Gey — Re —_ 4g’’R (D.55) 
Our starting point is from the equations 


~ 42 — —TH 47, s6py, (D.35) 
: 


and also equation (D.49) for R, which we write in the form 
R= gy, —I*y, -T —L (D.73) 
The second derivative of g¥” is 


a% guy _ = ). ( 0g 


ox, a2g ox, OL, 


+ yy 2 -+ y, = + Yas + vasa") 


(D.74) 
and therefore 


ae 


e% thy LV 
GAs = = /(- 9-(9° I + a a + 997 Y + gy. 
a ic) 4 


(D.75) 
On the other hand we have 


he 
ro — gen = —4(92 2 5 gery, 
+ 39*( ry, +P+42)—-T’+T,8E,  (D.76) 


Appendix D 429 


By comparing the last two equations we see that apart from the terms in T° 


and I“’ the same combitations of second derivatives occur in both. The 
calculation gives 


1 2a 


Re — JgR = ——____g* wy Pry +P +L 
ag Wen! da, Oa, + $9*"(ygy* + Poy, +P +2) 


TH! 4 er 4+P98E,  (D.77) 


a 


As always, the transformation of the terms involving first derivatives is the 
most complicated. We have 


Te, o6fy, = TT Ty, + AY 21Ty, + AY TH, + AY 8A, (D.78) 
Using (D.71) we get 
AM 2O1Ty, + AY ITE, = y,(I1s¥? + T1¥.%) — 3(yAT® + y'T*) — yy” 


(D.79) 
and as a result of 
TI 4 + 1% = a"? < Peg Se (D.80) 
we have 
ag” 
AM28ITy, + AY TI, = — y= — gy y* — Hy +9) — ty 
: (D.81) 
Further 
At BAx, = dy4y’ + b9yy* (D.82) 
Hence 
a ge Vv v 
TH o8Dy, = TI Plog — a — tgyy* — byty’ — 3" + yl) 
D.83) 


These relations allow us to rewrite (D.77) as 


uv P 1 gr a? gi 
RS =i = Q/(—9)” 2x, Oy 


+ 4gL + gery, +1) -T’ — kyl’ +yT*)  (D.84) 


+ Ti+: IT, — by¢y” 


Here we put 
Bey = Tey + dy4TY + oT) (D.85) 
and B= 4,0 =0 +My, (D6) 


and we express the g** in terms of the q> in the term involving the da’ Alember- 
tian. Then the eae for the Einstein tensor becomes 


a ee le 
: (D.87) 
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Since, according to (D.68), the determinant g can be expressed directly in terms 
of the g"’ we can take it that in (D.87) all terms except L are expressed in terms 
of the g* and their derivatives. It remains for us to express the Lagrangian 
LE also in terms of the sanie quantities. 

By the definition (D.50) we have 


agr* 
etic LY Wie D.88 
L 31%, ax, Py, ( ) 
Here we insert 
Tig = Wig + Aas (D.89) 
where, by (D.66) 
Ars = 4(y.83 + 98s — ¥°9ap) (D.90) 
Using the relation 
soe . 
a8 ar =? (D.91) 
which is easy to deduce from (D.90), we get 
B 
L=-—} my, —4$Tty, (D.92) 
Here we express ag*°/dz, in terms of 
ag* ag*® 
—_ = —_gq):{[{—~_— a8, 
B= Vi-2)-(& +9%,) (0.93) 
and use equation (D.71), which may be written in the form 
gly, = TY + y" (D.94) 
This gives, finally 
1 ag 
= —-—— II, — + x D.95 
24/(—9) cy fz, Byy ( ) 


Now only the g® remain under the sign of differentiation. 
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THE CHARACTERISTICS OF THE GENERALIZED 
D’ALEMBERT EQUATION 


The generalized wave equation, or d’Alembert equation, has the form 


Oy =0 (E.01) 
where (|) is the expression 
1 a ay 
— = ~q2b . 
Chip Wea av (—9)-9 =) (E.02) 


in which the quantities g** and g have their usual significance. 


Cauchy’s problem for the cquation (E.01) consists in determining the function 
tv, if on some hypersurface 


(Zo, 1, Ly, Z3) = const. (E.03) 


the values of U and db/@rg are given. We assume that the cquation of the 
hypersurface (E.03) can be solved for x) and that therefore 


— +0 (.04) 


We are interested in the possibility of solving Cauchy’s problem in some region 
sufficiently near to the hypersurface (E.03). To evaluate the function +) in this 
region one has to be able to calculate the derivatives of { at any point of the 
hypersurface. It is easy to see that the first derivatives can be calculated 
directly from the initial conditions. For the calculation of second derivatives, 
however, one necessarily has to use the wave equation. The possibility of 
determining the second derivatives will then depend on the form of the hyper- 
surface to which the initial conditions refer. If the hypersurface is such that 
initial conditions relative to it do not determine the values of the second 
derivatives it is called a characteristic. A characteristic hypersurface has the 
property that on it discontinuities of the second derivatives are possible. There- 
fore the moving surface of a wave front must be a characteristic. 

Let us consider first the simplest case, when the initial conditions refer to the 
hypersurface 2) = const., i.e. to an initial instant of time. From given values 
of p and db/dz, all first derivatives are obtainable by direct differentiation, 
and also the second derivatives of the form 


ea (i, k = 1, 2,3) (E.05) 
OL Oxy On OL 


(or rather the values of these quantities for fixed x9). As for the second deriva- 
tive with respect to Zp, it must be calculated from the wave equation. Explicitly 
the wave equation has the form 


feo Fe 0 (B.06) 


2 
Ox, 


43] 
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whére the dots denote terms containing the remaining second derivatives 
(E.05) and also the known first derivatives. Since by the nature of the metric 
tensor the quantity °° never vanishes, being always positive, equation (E.06) 
can always be solved for the remaining second derivative @%)/dr?. This means 
that if the variable x, has the character of a time, the Kypersurfaces x, = const. 
are not characteristics. 

Let us now consider the general case of a hypersurface (E.03). We introduce 
the variables 

1 =0G 2.2.0 )) 2.585 23s “ty (07) 

We denote by )’ the quantity considered as a function of the variables 
Z, %,, 2, and z,. Then we have 


nie. Reels (E.08) 
ox, 0x, OL, 


and since (E.04) holds, giving } and a/éx, for w = const. is equivalent to 
giving y’ and ap'/éx, for x =const. As before, these last quantities allow 
one to calculate all first derivatives and also the second derivatives of the 
form 


ary’ ay’ 
at aw? lant Ae , k = 1,2,3 . 
ana lm oe SCtC ae 
whereas the second derivative with respect to 2; must be determined from the 
wave equation. The d’Alembertian transformed to the new coordinates has the 
form 
A ay 
Uy = —~—; ma | (—9') 9% a] E.10 
where the g’** are obtained from the g** according to the usual tensor trans- 
formation rule ; in particular 
da, a’ Qu) Go 
Hy Oo pe ll 
Qn, Oa, 9 ax, Ox vo 
It is, however, to be remembered that since the new variable z, does not 
necessarily have the character of a time the inequality g’ > 0 need not hold. 
The transformed wave equation takes on the form 


(on So 2) OM ae i 25% (E.12) 
Ox, Ox,) Ox? — -"” 
where the terms omitted do not involve the second derivative with respect 
to x, but only second derivatives of the form (E.09) and also first derivatives. 
The values of all the omitted terms on the hypersurface w = const. may be 
taken to be known from the initial data relating to this hypersurface. 

The second derivative with respect to 2, remains indeterminate if and only 


if its coefficient in the wave equation (E.12) becomes zero, i.e. if the function w 
satisfies the equation 


g%=%g 


gy —— — = 0 (E.13) 


This is the equation of the characteristics for the wave equation (E.01). 
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The characteristics of the generalized wave equation are the same as the 
characteristics of the generally covariant Maxwell equations discussed in 
Section 46. Briefly, though not quite rigorously, this result may be justified 
by the following consideration. Under the condition V,A’ =0 Maxwell’s 
equations give rise to equations for the potentials A, in which the highest 
(second) derivatives are grouped in the form of the d’Alembertian. Hence it 
can be concluded that the characteristics of the generally covariant Maxwell 
equations have the form (E.13). The lack of rigour in this derivation consists 
in passing from the characteristics for the potentials to the characteristics for 
the field. It is not difficult to give a rigorous derivation in which one deals 
directly with the field components, similar to the procedure of Section 3 for 
cartesian coordinates in the Euclidean case. 


APPENDIX F 


INTEGRATION OF THE WAVE FRONT EQUATION 


If (Zo, Z1, Ly, Z3) = O (F.01) 
is the equation of a moving front then, as we know, the function w satisfies 
the partial differential equation 

goo, = 0 (F.02) 
where we have put, for brevity 


du) 
= — F.03 
Oa Fe (F.03) 
We consider the following problem : determine the form of the wave surface 
at an instant z) > 0 when its form is given at the initial instant 7, = 0. We 
solved a corresponding problem in Section 4 for the case of a Euclidean metric 
and Galilean coordinates ; we now consider it for the general case. 


Let the form of the wave surface at the initial instant be given by the equations 


av, = f,(u, 2) 
Lo = fo(u, v) for z,=0 (F.04) 
Ly = f,(u, v) 
These equations may be written in the more symmetric form 
t,=f.u,r);  («=0,1, 2, 3) (F.05) 


where 
fo =0 (F.06) 

Instead of initial data referring to the instant of time z) = 0 one can consider 
Cauchy data referring to some hypersurface; then the function f, will no 
longer be identically zero. 

If we insert the expressions (F.05) into the equation (F.01) we obtain an 
identity in « and v. Differentiating this identity with respect to u and to v 
we get the relations 


of. of. 
aS — 0 7 so — . 
Rpg ag oe 
If we adjoin to these the wave front equation (F.02) we can determine from these 
three homogeneous equations the initial values of the four quantities w,, «,, 
®, and w, apart from a common multiplier. Denoting these initial values by 
the superfix zero we have 
we = Ap, (u, ¥) ; A = A(u, v) (F.08) 
where the 9, are known functions and 2 is an arbitrary function of u and v. 
It should be noted that the ratios of the wo are not y ‘e uniquely given; 
they may have two values, because of the fact that the wave front equation 
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(F.02) is quadratic in the w,. For a unique determination of these ratios one 
must also indicate in which of the two possible directions the wave is travelling. 
To indicate that the left-hand sides of the equations (F.05) represent the 
initial values of the coordinates we attach the superfix zero to them, writing 
them as 
r= f,(u, 2) (F.09) 
We take a point on the initial wave surface (to each such point there corres- 
ponds a definite pair of values of u and v) and consider the ray passing through 
this point. As we saw in Section 38 the differential equations of the ray are the 
Hamiltonian equations corresponding to the Hamilton-Jacobi equation (F.02). 
Thus, according to (38.41), the equations have the form 


dp 2 ae, us (F.10) 


In Section 38 it was shown that these equations are equivalent to the equations 
of a null geodesic. 

For the ray passing through the point (w, v) the initial values of the variables 
Z, and w, are equal respectively to (F.09) and (F.08). Integrating equation 
(F.10) with the initial values 2° and w? we get 


= : 0 : 0 4) 29 «9 
Ly = IPs Lh Ly, Te, Ty Wh, Wy, 3, 05) 


F.11 
0, = O,Ps 7 8, th 285 68, of, of a) on 
Here we insert the initial values (F.09) and (F.08) and so obtain expressions 
of the form 


Le = FQ, U, v) (F.12) 
We — AD (Ap, u, v) 
Since the equations (F.10) do not change under the substitution 
©, =Awo,; pi’ =>Ap (F.13) 
where 2 is constant along the ray, the functions F, and ®, do not depend on 
x and p separately but only on their product dp. 
The expressions 
a, = Fp’, u, v) (F.14) 
give the equation of a moving wave surface in parametric form. By eliminating 
the variables p’, u and v one can obtain a relation between the four coordinates 
(2, 24, L_, Z) which is just the equation of a wave surface in the form (F.01) 
The proof of the relations and formulae mentioned here may be found in 
treatises on first order partial differential equations (e.g. see [8}). 
As the simplest example let us consider the equation 


1 
503 — (of + af + 08) = 0 (F.15) 


with the initial form of the wave surface given by 
z=f(z,y) for t=90 (F.16) 
Here the cartesian coordinates x and y play the part of the parameters wu and v. 


The equations (F.07) appear as 
o, + o,f, =9; Ws +a,f, =0 (F.17) 
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where f, and f, are the partial derivatives of f with respect to x and y. From 
(F.15) and (F.17) we find as one of the possible solutions 


Wg = Ac 
_ fz 
Or V+ FE +f) 
me _ Of (F.18) 
V+ f2 +f?) 
r 
aos 


VL +fe + Sp) 


The other solution is obtained from (F.18) by changing the sign of the square 
root. In equations (F.17) and (F.18) we should, strictly speaking, write w? 
instead of w,, but we omit the superfix zero because in this example the a, 
are constant anyhow. Solving the equations 


dig 1 day : : 
ap dp —GO)j 5 (2 => l, 2, 3) (F.19) 


aid assuming that at t = 9 we have also p = 0, we get 
Ly = —%o,. x, = 2 —w,p (* = 1, 2, 3) (F.20) 


Here we insert the w, from (F.18) and express the parameter p in terms of t. 
We then have for the first solution 


ctf, 
V(1 +f2 +f?) 


Ly = E+ 


pee (F.21) 
Ve ie) 


et 


For the second solution (with the opposite direction of propagation) we get 


ctf 
Ly =f — —___+ 
Vl fe 
rs = flz,y) + . 


VALE IE +S5) 
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In particular, if we take for the initial wave surface the surface of a sphere of 
radius a and put 


f(z, y) = (a? — «? — y?) (F.23) 
we get from (F.22) 
et 
Ly =f (3 + “| 
yo=yY (3 “) (F.24) 


t3 = a/ (a? —a—y?)-(1 +2) 
and, after eliminating z and y 


xt + 22 + a2 = (a + ct)? (F.25) 
i.e. the surface of a sphere of radius R = a + ct, as was to be expected. 


‘ APPENDIX G 


NECESSARY AND SUFFICIENT CONDITIONS 
FOR THE EUCLIDEAN CHARACTER 
OF THREE-DIMENSIONAL SPACE 


In three-dimensional space the second rank curvature tensor Ri, has the 
same number of components, namely six, as the fourth rank curvature tensor 
Rit,mx. Therefore one should expect that not only is Riz expressible in terms of 
Ru,mx by the general rule 


Ry = UR, me (G.01) 
but that also, conversely, Ru,mz can be expressed in terms of Ry, (we here 
omit the label a for components of a three-dimensional tensor). 

To find these expressions we introduce a system of antisymmetric quantities 


Ey, With e2, =1, similarly to our procedure in Section 22 and 37 and we 
construct an antisymmetric pseudo-tensor with the covariant components 


Egy = V4-E gy (G.02) 


and the contravariant components 


1 
Eth — ar Essp (G.03) 
For transformations of the coordinates x,, x, and z, that have a positive 
Jacobian 
z\ D(x, a2, x 
D( — Dy tp %) 9 (G.04) 
a D (2,, £4, 25) 
we have 


va'-D(* ) a5 (G.05) 


and therefore 


, OX, Ox, aa’ 
Ej, | 1 b= EF G.06 
igh (G.06) 
OX» OL, OL, ae 
by the rule for forming a determinant. This proves that in such transformations 
the quantities H,,, behave like a covariant tensor. By the rule for forming a 
determinant we can also write 


Ey,a'Pa*%a't = Ear (G.07) 
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where #?4° has the value (G.03). This proves that E?% is the contravariant 
pseudo-tensor corresponding to Eyx1:. Hence it is simple to obtain the relations 


Ey ata = Era, , (G.08) 

Ey,a" = EP, Aig (G.09) 
and 

Ex, = EPA, yg, (G.10) 


We note also the relation 
EPuk., = 878% — §P5¢ (G.11) 


which will be important for the subsequent discussion. It can be proved by the 
following reasoning. Both sides are different from zero only if p #q and 
r + s, and apart from their order the pair of numbers (p, g) must coincide with 
the pair (7, s). If p =r and gq =s, both sides are equal to +1 and if p=s 
and q = r both are equal to —1. Therefore both sides coincide for all possible 
values of the indices, so that equation (G.11) is proved. 

To find the. expression for Rz,mx in terms of Ry we introduce the contra- 
variant symmetric second rank tensor A? by the equations 


APE Ham = Ru, me (G.12) 


and then establish the relation between A?% and R?%. The equations (G.12) 
may be written in the form 


; = F 33 __ 
aA' = Ros, 233 aA? — Ry, 313 aA — Ryo, 12 


G.13 
aA*® = Ry, 12; aA3! = Ryo, 93; aAl? = Ros, 91 ( ) 
with 
AMS AY (G.14) 
These equations express the Apq directly in terms of the Ry, mx- 
Inserting (G.12) into (G.01) and using (G.09) and (G.11) we get 
Ry = APUaprdig — Apqtk) (G.15) 
and if we put 
G.16 
A =a,,A?4 (G.16) 
we have 
Riz = Az — anA (G.17) 
whence 
R= attRy, = —2A (G.18) 
and therefore 
An = Rin — tank (G.19) 


Thus Azz is simply the threc-dimensional conservative tensor. 
Inserting the corresponding contravariant tensor into (G.12) we obtain the 


required expression of the fourth rank tensor in terms of the second rank 


tensor : 
Riu. mg = (P27 — LaPIR)E pul gmk (G.20) 
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From the equations just obtained we can deduce the following important con- 
sequence. We know (Section 42) that the necessary and sufficient condition for 
the possibility of reducing a given quadratic form ds? to a form with constant 
coefficients is the vanishing for the fourth rank curvature tensor. This result 
obviously also applies to the purely spatial three-dimensional quadratic form 


dl? = ay, dx, dxx (G.21) 
for the reduction of which to the Euclidean form 
dl? = dx? + da? + dx? (G.22) 
the necessary and sufficient condition is the equation 
Ri, mk = 0 (G.23) 


where the quantity on the left-hand side is the three-dimensional tensor. 
But by (G.20) the fourth rank curvature tensor is, in three-dimensional space, 
expressible in terms of the second rank curvature tensor. Therefore the neces- 
sary and sufficient condition for the Euclidean character of a three-dimensional 
space is the vanishing of the second rank curvature tensor. 
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